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Sequence Listing 



[1) GENERAL INFpRMATION: 



(i) APPLICAJSIT: Chiron Corporation 

\ 

[ii) TITLE OF^ INVENTION: Methods for Administration of 

Recombinant Gene Delivery Vehicles for Treatment of Hemophilia 
and Other Eiisorders 



(iii) 
(iv) 



(v) 



(B) 
(C) 
(D) 
(E) 
(F) 



STREET 
CITY: 
STATE : 




NUMBER OF \SEQUENCES : 83 



CORRESPOND 
(A) ADDRE 



CE ADDRESS: 

EE: Chiron Corporation 
4560 Horton Street 
leryville 
alif ornia 
COUNTRY i U.S.A. 

ZIP: 94^8 

\ 



COMPUTER READ^^LE FORM: 

(A) MEDIUM TY^E: Floppy disk 

(B) COMPUTER: iIBM PC compatible 

(C) OPERATING ^YSTEM : PC-DOS /MS-DOS 

(D) SOFTWARE: ^tentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: \ 

(C) CLASSIFICATION^: 

' . . . \ 
(viii) ATTORNEY /AGENT INFOl^LMATION : 

(A) NAME: Kruse, Notrman J. 

(B) REGISTRATION NUMBER: 3 5,23 5 

(C) REFERENCE /DOCKE'A NUMBER: 1155.005 

\ 

(ix) TELECOMMUNICATION INFdtf^MATION : 

(A) TELEPHONE: (510) '&23-3520 

(B) TELEFAX: (510) 65^.-3542 



(2) INFORMATION FOR SEQ ID NO : 1 : \ 
(i) SEQUENCE CHARACTERISTICS:; 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ( genomic ^ 
(xi) SEQUENCE DESCRIPTION: SEQ 10 NO : 1 : 
GAGAGATGGG GGAGGCTAAC TGAG 



24 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



{dV TOPOLOGY: linear 
(ii) MOLEpULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GATCCTCAGT TAGtCTCCCC CATCTCTC 2 8 

\ 

(2) INFORMATION^, FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TATATCTCCA GATGAGGTAC ATGATTTTAG GCTTG 3 5 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE.: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
TATATATCGA TTCAAGGCAT TTTCTTTTCA TCAATAAAAC 4 0 

(2) INFORMATION FOR SEQ '"^D NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: liAear 

(ii) MOLECULE TYPE; DNA. (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TCGAGGATCC GCCCGGGCGG CCGCATCGAT GTCGACG 3 7 

(2) INFORMATION FOR SEQ ID Nd: 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCGTCGACA TCGATGCGGC CGCCCGGGCG GATCC 3 5 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AGTGAATTCG AGCTCGGTAC CCGGGGATCC TCTAGAGTCG ACCTGCAGGC ATGCAAGCTT 60 
GGCGTAATCA TGGTCAT 7 7 

\ 



(2) INFORMAtVoN for SEQ id NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) \^ LENGTH; 8 amino acids 

(B) \TYPE: amino acid 

(C) 'STRANDEDNESS : single 

(D) ^OPOLOGY: linear 
(ii) MOLECtlLE TYPE: protein 

(xi) SEQUEr^CE DESCRIPTION: SEQ ID NO : 8 
Ala Arg Glju Met Gly Glu Ala Asn 

(2) INFORMATION IFOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYi»E: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOTOLOGY: linear 

(ii) MOLECULeI TYPE : DNA (genomic) 
(xi) sequence! DESCRIPTION: SEQ ID NO : 9 
CCCGAGAGAT GGGGGAQGCT AACTGAG 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE (JHAFIACTERISTICS : 

(A) LENG-:^: 31 base pairs 

(B) TYPE : \ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOpY: linear 

(ii) MOLECULE TYPE- DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GGGCTCTCTA CCCCCTCCGi^ TTGACACCTA G 

(2) INFORMATION FOR S^Q ID NO : 11 : 
(i) SEQUENCE CHAI^CTERISTICS : 

(A) LENGTH: B amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNlESS : single 

(D) TOPOLOGY :\ linear 
^ (ii) MOLECULE TYPE: \protein 

■\ (xi) SEQUENCE DESCRI^PTION : SEQ ID NO : 1 
/ Thr He Met Thr Met 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
CCCTGTGCCT TATTTGAACT AACC 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDBDNESS : single 
CD) TOPOLOgV: linear 
(ii) MOLECULE TYP^ : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCCACCACAA CCACATATCC tTCC 24 

\ 

(2) INFORMATION FOR SEq\ ID N0:14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nuclWic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: iVnear 

(ii) MOLECULE TYPE: Dl^A (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCAGTCCTCC GATTGACTG \, 19 

(2) INFORMATION FOR SEQ IdVj0:15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8332 base pairs 

(B) TYPE: nucleic kcid 

(C) STRANDEDNESS: Aingle 

(D) TOPOLOGY: lineaV 
(ii) MOLECULE TYPE: DNA {genomic) 
(xi) SEQUENCE DESCRIPTION :, SEQ ID NO: 15: 

GCGCCAGTCC TCCGATTGAC TGAGTCGCCp GGGTACCCGT GTATCCAATA AACCCTCTTG 60 

CAGTTGCATC CGACTTGTGG TCTCGCTGTT' CCTTGGGAGG GTCTCCTCTG AGTGATTGAC 12 0 

TACCCGTCAG CGGGGGTCTT TCATTTGGGG ^CTCGTCCGG GATCGGGAGA CCCCTGCCCA IBO 

GGGACCACCG ACCCACCACC GGGAGGTAAG cVgGCCAGCA ACTTATCTGT GTCTGTCCGA 240 

TTGTCTAGTG TCTATGACTG ATTTTATGCG CC^GCGTCGG TACTAGTTAG CTAACTAGCT 3 00 

CTGTATCTGG CGGACCCGTG GTGGAACTGA CGAi^TTCGGA ACACCCGGCC GCAACCCTGG 3 60 

GAGACGTCCC AGGGACTTCG GGGGCCGTTT TTGt\5GCCCG ACCTGAGTCC AAAAATCCCG 420 

ATCGTTTTGG ACTCTTTGGT GCACCCCCCT TAGAC^AGGG ATATGTGGTT CTGGTAGGAG 480 

ACGAGAACCT AAAACAGTTC CCGCCTCCGT CTGAAT^TTT GCTTTCGGTT TGGGACCGAA 540 

GCCGCGCCGC GCGTCTTGTC TGCTGCAGCA TCGTTCTY^TG TTGTCTCTGT CTGACTGTGT 600 

TTCTGTATTT GTCTGAGAAT ATGGGCCAGA CTGTTACCyi.C TCCCTTAAGT TTGACCTTAG 6 60 

GTCACTGGAA AGATGTCGAG CGGATCGCTC ACAACCAG']\C GGTAGATGTC AAGAAGAGAC 72 0 

GTTGGGTTAC CTTCTGCTCT GCAGAATGGC CAACCTTTa\ CGTCGGATGG CCGCGAGACG 780 

\ 

GCACCTTTAA CCGAGACCTC ATCACCCAGG TTAAGATCAA^^GGTCTTTTCA CCTGGCCCGC 840 

ATGGACACCC AGACCAGGTC CCCTACATCG TGACCTGGGA J^GCCTTGGCT TTTGACCCCC 900 

CTCCCTGGGT CAAGCCCTTT GTACACCCTA AGCCTCCGCC itcTCTTCCT CCATCCGCCC 9 60 



CGTCTCTCCC CCTTGAACCIT CCTCGTTCGA CCCCGCCTCG ATCCTCCCTT TATCCAGCCC 102 0 

TCACTCCTTC TCTAGGCGCC AAACCTTV^AC CTCAAGTTCT TTCTGACAGT GGGGGGCCGC lOBO 

TCATCGACCT ACTTACAGAa\ GACCCCCCGC CTTATAGGGA CCCAAGACCA CCCCCTTCCG 114 0 

ACAGGGACGG AAATGGTGGA iSAAGCGACCC CTGCGGGAGA GGCACCGGAC CCCTCCCCAA 12 00 

TGGCATCTCG CCTACGTGGG AffiACGGGAGC CCCCTGTGGC CGACTCCACT ACCTCGCAGG 12 60 

CATTCCCCCT CCGCGCAGGA GGAAACGGAC AGCTTCAATA CTGGCCGTTC TCCTCTTCTG 1320 

ACCTTTACAA CTGGAAAAAT AATAACCCTT CTTTTTCTGA AGATCCAGGT AAACTGACAG 13 80 

CTCTGATCGA GTCTGTTCTC ATCaJvCCATC AGCCCACCTG GGACGACTGT CAGCAGCTGT 1440 

TGGGGACTCT GCTGACCGGA GAAGaJ^AAAC AACGGGTGCT CTTAGAGGCT AGAAAGGCGG 15 00 

TGCGGGGCGA TGATGGGCGC CCCACT(iAAC TGCCCAATGA AGTCGATGCC GCTTTTCCCC 15 60 

\ 

[ 

TCGAGCGCCC AGACTGGGAT TACACCACCC AGGCAGGTAG GAACCACCTA GTCCACTATC 162 0 

GCCAGTTGCT CCTAGCGGGT CTCCAAAAC^ CGGGCAGAAG CCCCACCAAT TTGGCCAAGG 1680 

TAAAAGGAAT AACACAAGGG CCCAATGAGtVtCCCTCGGC CTTCCTAGAG AGACTTAAGG 17 4 0 

\ 

AAGCCTATCG CAGGTACACT CCTTATGACC AtGAGGACCC AGGGCAAGAA ACTAATGTGT IHOO 

\ 

CTATGTCTTT CATTTGGCAG TCTGCCCCAG ACATTGGGAG AAAGTTAGAG AGGTTAGAAG 1860 

\ 

ATTTAAAAAA CAAGACGCTT GGAGATTTGG TTA^AGAGGC AGAAAAGATC TTTAATAAAC 192 0 

GAGAAACCCC GGAAGAAAGA GAGGAACGTA TCAG^AGAGA AACAGAGGAA AAAGAAGAAC 19 80 

GCCGTAGGAC AGAGGATGAG CAGAAAGAGA AAGAAAGAGA TCGTAGGAGA CATAGAGAGA 2040 

TGAGCAAGCT ATTGGCCACT GTCGTTAGTG GACAGAAACA GGATAGACAG GGAGGAGAAC 2100 

GAAGGAGGTC CCAACTCGAT CGCGACCAGT GTGCCTAffTG CAAAGAAAAG GGGCACTGGG 2160 

CTAAAGATTG TCCCAAGAAA CCACGAGGAC CTCGGGGAffC AAGACCCCAG ACCTCCCTCC 2220 

TGACCCTAGA TGACTAGGGA GGTCAGGGTC AGGAGCCCCff CCCTGAACCC AGGATAACCC 22 80 

TCAAAGTCGG GGGGCAACCC GTCACCTTCC TGGTAGATAc\tGGGGCCCAA CACTCCGTGC 23 40 

TGACCCAAAA TCCTGGACCC CTAAGTGATA AGTCTGCCTG qGTCCAAGGG GCTACTGGAG 2 400 

GAAAGCGGTA TCGCTGGACC ACGGATCGCA AAGTACATCT AckrTACCGGT AAGGTCACCC 2460 

ACTCTTTCCT CCATGTACCA GACTGTCCCT ATCCTCTGTT AGQAAGAGAT TTGCTGACTA 2 52 0 

AACTAAAAGC CCAAATCCAC TTTGAGGGAT CAGGAGCTCA GGTTATGGGA CCAATGGGGC 2 5 80 

AGCCCCTGCA AGTGTTGACC CTAAATATAG AAGATGAGCA TCGGOTACAT GAGACCTCAA 2 640 

AAGAGCCAGA TGTTTCTCTA GGGTCCACAT GGCTGTCTGA TTTTCQTCAG GCCTGGGCGG 27 00 



AAACCGGGGG CATGQGACTG GCAGTTCGCC AAGCTCCTCT GATCATACCT CTGAAAGCAA 2 760 

CCTCTACCCC CGTGTCCATA AAACAATACC CCATGTCACA AGAAGCCAGA CTGGGGATCA 282 0 

'\ 

AGCCCCACAT ACAGAGA^TG TTGGACCAGG GAATACTGGT ACCCTGCCAG TCCCCCTGGA 2 8 80 

\ 

ACACGCCCCT GCTACCCG^T AAGAAACCAG GGACTAATGA TTATAGGCCT GTCCAGGATC 2 940 

TGAGAGAAGT CAACAAGCGG GTGGAAGACA TCCACCCCAC CGTGCCCAAC CCTTACAACC 3 000 

TCTTGAGCGG GCTCCCACCg\ TCCCACCAGT GGTACACTGT GCTTGATTTA AAGGATGCCT 3 060 

TTTTCTGCCT GAGACTCCAC CCCACCAGTC AGCCTCTCTT CGCCTTTGAG TGGAGAGATC 312 0 

CAGAGATGGG AATCTCAGGA CRATTGACCT GGACCAGACT CCCACAGGGT TTCAAAAACA 3180 

GTCCCACCCT GTTTGATGAG GckcTGCACA GAGACCTAGC AGACTTCCGG ATCCAGCACC 3 240 

CAGACTTGAT CCTGCTACAG TAC^TGGATG ACTTACTGCT GGCCGCCACT TCTGAGCTAG 3 3 00 

ACTGCCAACA AGGTACTCGG GCCCTGTTAC AAACCCTAGG GAACCTCGGG TATCGGGCCT 3 3 60 

CGGCCAAGAA AGCCCAAATT TGCCAgVaAC AGGTCAAGTA TCTGGGGTAT CTTCTAAAAG 3 42 0 

AGGGTCAGAG ATGGCTGACT GAGGCCA^AA AAGAGACTGT GATGGGGCAG CCTACTCCGA 3 4 80 

AGACCCCTCG ACAACTAAGG GAGTTCCTAG GGACGGCAGG CTTCTGTCGC CTCTGGATCC 3 54 0 

CTGGGTTTGC AGAAATGGCA GCCCCCTTGIA ACCCTCTCAC CAAAACGGGG ACTCTGTTTA 3 6 00 

ATTGGGGCCC AGACCAACAA AAGGCCTATC AAGAAATCAA GCAAGCTCTT CTAACTGCCC 3 660 

CAGCCCTGGG GTTGCCAGAT TTGACTAAGC CGTTTGAACT CTTTGTCGAC GAGAAGCAGG 3 72 0 

GCTACGCCAA AGGTGTCCTA ACGCAAAAAC TGC^ACCTTG GCGTCGGCCG GTGGCCTACC 3 7 80 

\ 

TGTCCAAAAA GCTAGACCCA GTAGCAGCTG GGTGpCCCCC TTGCCTACGG ATGGTAGCAG 3 840 



rA^c 



CCATTGCCGT ACTGACAAAG GATGCAGGCA AGCTA^CCAT GGGACAGCCA CTAGTCATTC 3 900 

TGGCCCCCCA TGCAGTAGAG GCACTAGTCA AACAAcApCC CGACCGCTGG CTTTCCAACG 3 960 

CCCGGATGAC TCACTATCAG GCCTTGCTTT TGGACACG^A CCGGGTCCAG TTCGGACCGG 4 02 0 

\ 

TGGTAGCCCT GAACCCGGCT ACGCTGCTCC CACTGCCTCil^ GGAAGGGCTG CAACACAACT 4080 

\^ 

GCCTTGATAT CCTGGCCGAA GCCCACGGAA CCCGACCCGA ■ CCTAACGGAC CAGCCGCTCC 4140 

'\ 

\ 

CAGACGCCGA CCACACCTGG TACACGGATG GAAGCAGTCT CTTACAAGAG GGACAGCGTA 4 2 00 

AGGCGGGAGC TGCGGTGACC ACCGAGACCG AGGTAATCTG cdfTAAAGCC CTGCCAGCCG 42 60 

GGACATCCGC TCAGCGGGCT GAACTGATAG CACTCACCCA GGCC^CTAAAG ATGGCAGAAG 43 2 0 

gtaagaagct aaatgtttat actgatagcc gttatgcttt tgctActgcc CATATCCATG 43 8 0 



GAGAAATATA CAG^GGCGT GGGTTGCTCA CATCAGAAGG CAAAGAGATC AAAAATAAAG 444 0 

ACGAGATCTT GGCCQTACTA AAAGCCCTCT TTCTGCCCAA AAGACTTAGC ATAATCCATT 4 500 

GTCCAGGACA TCAAAAGGGA CACAGCGCCG AGGCTAGAGG CAACCGGATG GCTGACCAAG 4 560 

CGGCCCGAAA GGCAGCCA^TC ACAGAGACTC CAGACACCTC TACCCTCCTC ATAGAAAATT 4 620 

CATCACCCTA CACCTCAGAf^ CATTTTCATT ACACAGTGAC TGATATAAAG GACCTAACCA 4 680 

AGTTGGGGGC CATTTATGAt\ AAAACAAAGA AGTATTGGGT CTACCAAGGA AAACCTGTGA 4740 

TGCCTGACCA GTTTACTTTT J^TTATTAG ACTTTCTTCA TCAGCTGACT CACCTCAGCT 4 800 

TCTCAAAAAT GAAGGCTCTC CTAGAGAGAA GCCACAGTCC CTACTACATG CTGAACCGGG 4 860 

ATCGAACACT CAAAAATATC ACT(^AGACCT GCAAAGCTTG TGCACAAGTC AACGCCAGCA 4 920 

AGTCTGCCGT TAAACAGGGA ACTAC^GTCC GCGGGCATCG GCCCGGCACT CATTGGGAGA 4 9R0 

TCGATTTCAC CGAGATAAAG CCCGGATTGT ATGGCTATAA ATATCTTCTA GTTTTTATAG 5 04 0 

ATACCTTTTC TGGCTGGATA GAAGCCTl^C CAACCAAGAA AGAAACCGCC AAGGTCGTAA 5100 

CCAAGAAGCT ACTAGAGGAG ATCTTCCCC}\^ GGTTCGGCAT GCCTCAGGTA TTGGGAACTG 5160 

ACAATGGGCC TGCCTTCGTC TCCAAGGTGA \5TCAGACAGT GGCCGATCTG TTGGGGATTG 5 22 0 

ATTGGAAATT ACATTGTGCA TACAGACCCC AJy^GCTCAGG CCAGGTAGAA AGAATGAATA 5 2R0 

GAACCATCAA GGAGACTTTA ACTAAATTAA CG^TGCAAC TGGCTCTAGA GACTGGGTGC 53 4 0 

\ 

TCCTACTCCC CTTAGCCCTG TACCGAGCCC GCAA(^CGCC GGGCCCCCAT GGCCTCACCC 54 00 

CATATGAGAT CTTATATGGG GCACCCCCGC CCCTTG'^AAA CTTCCCTGAC CCTGACATGA 5460 

CAAGAGTTAC TAACAGCCCC TCTCTCCAAG CTCACTtA^A GGCTCTCTAC TTAGTCCAGC 5 520 

ACGAAGTCTG GAGACCTCTG GCGGCAGCCT ACCAAGAACA ACTGGACCGA CCGGTGGTAC 5580 

CTCACCCTTA CCGAGTCGGC GACACAGTGT GGGTCCGCCG ^VCACCAGACT AAGAACCTAG 5640 



AACCTCGCTG GAAAGGACCT TACACAGTCC TGCTGACCAC CC\CCACCGCC CTCAAAGTAG 57 0 0 

v 

\ 

ACGGCATCGC AGCTTGGATA CACGCCGCCC ACGTGAAGGC TGC^GACCCC GGGGGTGGAC 5760 

CATCCTCTAG ACTGACATGG CGCGTTCAAC GCTCTCAAAA CCCCTTAAAA ATAAGGTTAA 5 820 

CCCGCGAGGC CCCCTAATCC CCTTAATTCT TCTGATGCTC AGAGGG(3TCA GTACTGCTTC 5 880 

\ 

\ 

GCCCGGCTCC AGTCCTCATC AAGTCTATAA TATCACCTGG GAGGTAA^SCA ATGGAGATCG 594 0 

GGAGACGGTA TGGGCAACTT CTGGCAACCA CCCTCTGTGG ACCTGGTGCk: CTGACCTTAC 600 0 

CCCAGATTTA TGTATGTTAG CCCACCATGG ACCATCTTAT TGGGGGCTAGyAATATCAATC 6 060 

CCCTTTTTCT TCTCCCCCGG GGCCCCCTTG TTGCTCAGGG GGCAGCAGCC OAGGCTGTTC 612 0 



CAGAGACTGC GAAGAXcCTT TAACCTCCCT CACCCCTCGG TGCAACACTG CCTGGAACAG 6180 

ACTCAAGCTA GACCAGACAA CTCATAAATC AAATGAGGGA TTTTATGTTT GCCCCGGGCC 6240 

CCACCGCCCC CGAGAATCCA AGTCATGTGG GGGTCCAGAC TCCTTCTACT GTGCCTATTG 63 00 

GGGCTGTGAG ACAACCGGTA GAGCTTACTG GAAGCCCTCC TCATCATGGG ATTTCATCAC 63 60 

AGTAAACAAC AATCTCACCT VtGACCAGGC TGTCCAGGTA TGCAAAGATA ATAAGTGGTG 642 0 

CAACCCCTTA GTTATTCGGT TTACAGACGC CGGGAGACGG GTTACTTCCT GGACCACAGG 6480 

ACATTACTGG GGCTTACGTT TgVaTGTCTC CGGACAAGAT CCAGGGCTTA CATTTGGGAT 6540 

CCGACTCAGA TACCAAAATC TAGffiACCCCG CGTCCCAATA GGGCCAAACC CCGTTCTGGC 6600 

AGACCAACAG CCACTCTCCA AGCCOAAACC TGTTAAGTCG CCTTCAGTCA CCAAACCACC 6660 

CAGTGGGACT CCTCTCTCCC CTACCcWcT TCCACCGGCG GGAACGGAAA ATAGGCTGCT 672 0 

\ 

AAACTTAGTA GACGGAGCCT ACCAAGC^iCT CAACCTCACC AGTCCTGACA AAACCCAAGA 67 8 0 

gtgctggttg tgtctagtag cgggaccc\c CTACTACGAA GGGGTTGCCG TCCTGGGTAC 6840 

CTACTCCAAC CATACCTCTG CTCCAGCCAA \^CTGCTCCGTG GCCTCCCAAC ACAAGTTGAC 6900 

CCTGTCCGAA GTGACCGGAC AGGGACTCTG d^TAGGAGCA GTTCCCAAAA CACATCAGGC 6960 

CCTATGTAAT ACCACCCAGA CAAGCAGTCG AGQGTCCTAT TATCTAGTTG CCCCTACAGG 7 02 0 

\ 

\ 

TACCATGTGG GCTTGTAGTA CCGGGCTTAC TCCATGCATC TCCACCACCA TACTGAACCT 7080 

\ 

TACCACTGAT TATTGTGTTC TTGTCGAACT CTGGCCAAGA GTCACCTATC ATTCCCCCAG 714 0 

V 

CTATGTTTAC GGCCTGTTTG AGAGATCCAA CCGACAC^AAA AGAGAACCGG TGTCGTTAAC 72 0 0 

CCTGGCCCTA TTATTGGGTG GACTAACCAT GGGGGGA/^T GCCGCTGGAA TAGGAACAGG 72 60 

GACTACTGCT CTAATGGCCA CTCAGCAATT CCAGCAGCT^ CAAGCCGCAG TACAGGATGA 73 20 

TCTCAGGGAG GTTGAAAAAT CAATCTCTAA CCTAGAAAAg\tCTCTCAi:TT CCCTGTCTGA 73 80 

AGTTGTCCTA CAGAATCGAA GGGGCCTAGA CTTGTTATTT AtAAAAGAAG GAGGGCTGTG 7 44 0 

TGCTGCTCTA AAAGAAGAAT GTTGCTTCTA TGCGGACCAC a\^GGACTAG TGAGAGACAG 7500 

catggccaaa ttgagagaga ggcttaatca gagacagaaa ctgWtgagt CAACTCAAGG 7560 

ATGGTTTGAG GGACTGTTTA ACAGATCCCC TTGGTTTACC ACCTTGATAT CTACCATTAT 7 62 0 

GGGACCCCTC ATTGTACTCC TAATGATTTT GCTCTTCGGA CCCTGqATTC TTAATCGATT 7680 

AGTCCAATTT GTTAAAGACA GGATATCAGT GGTCCAGGCT CTAGTTTffGA CTCAACAATA 7740 

TCACCAGCTG AAGCCTATAG AGTACGAGCC ATAGATAAAA TAAAAGAtVt TATTTAGTCT 7 800 



CCAGAAAAAG GGGGOAATGA AAGACCCCAC CTGTAGGTTT GGCAAGCTAG CTTAAGTAAC 7860 
GCCATTTTGC AAGGCATGGA AAAATACATA ACTGAGAATA GAGAAGTTCA GATCAAGGTC 7 92 0 

AGGAACAGAT GGAACA^^CTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG 7980 



aV^ 



CCCCGGCTCA GGGCCAAGAA CAGATGGAAC AGCTGAATAT GGGCCAAACA GGATATCTGT 8040 

\ 

\ 

GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC AAGAACAGAT GGTCCCCAGA TGCGGTCCAG 8100 

\ 

CCCTCAGCAG TTTCTAGAg'^ ACCATCAGAT GTTTCCAGGG TGCCCCAAGG ACCTGAAATG 8160 

ACCCTGTGCC TTATTTGAA(i TAACCAATCA GTTCGCTTCT CGCTTCTGTT CGCGCGCTTC 822 0 

tgctccccga gctcaataaa Vgagcccaca ACCCCTCACT CGGGGCGCCA GTCCTCCGAT 82 80 

TGACTGAGTC GCCCGGGTAC ^ICGTGTATCC AATAAACCCT CTTGCAGTTG CA 83 32 



(2) INFORMATION FOR SEQ'. ID NO: 16: 
(i) SEQUENCE CHARAaTERISTICS : 

(A) LENGTH: 32\base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE^S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 
(XI ) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGAGTGGTA ACAGTCTGGC CTTAA^^TCTC AG 32 

(2) INFORMATION FOR SEQ ID ]Sb:17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 basA pairs 

(B) TYPE: nucleic a^id 

(C) STRANDEDNESS : si^ngle 

(D) TOPOLOGY: linear\ 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: BEQ ID NO: 17: 
CGGTCGACCT CGAGAATTAA TTC \ 23 

\ (2) INFORMATION FOR SEQ ID NO: 18^ 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base paiVs 

(B) TYPE: nucleic acid ^ 

(C) STRANDEDNESS: singly 

(D) TOPOLOGY: linear \ 
(ii) MOLECULE TYPE: DNA (genom:\c) 
(xi) SEQUENCE DESCRIPTION: SEQ 'vID NO: 18: 

CTGGGAGACG TCCCAGGGAC TTC \ 23 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear \ 
(ii) MOLECULE TYPE: DNA (genomic) \ 
(xi) SEQUENCE DESCRIPTION: SEQ ID 1^0:19: 

GGCCAGACTG TTACCACTCC CTGAAGTTTG AC \ 3 2 




(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
{A)\eNGTH: 3 0 base pairs 

(B) V^PE: nucleic acid 

(C) AtRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECUQE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CATCGATAAA ATAAAAGATT TTATTTAGTC 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE \CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPEA nucleic acid 

(C) STRANbEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE T'^^E : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CAAATGAAAG ACCCCCGCTG 

(2) INFORMATION FOR S^ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3^ base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNEfiS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GAAGCTTCTC CCAGAACCCA CCAQTCTTGA AACGCCATC 

(2) INFORMATION FOR SEQ Id\n0:23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bate pairs 

(B) TYPE: nucleic \acid 

(C) STRANDEDNESS : feingle 

(D) TOPOLOGY: lineir 

(ii) MOLECULE TYPE: DNA (Wenomic) 
(xi) SEQUENCE DESCRIPTIOnV SEQ ID NO: 23 
TACCAGCTT TTGGTCTCAT CAAAG \ 

INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERIST KTS : 

(A) LENGTH: 3 6 base pkirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sin^e 

(D) TOPOLOGY; linear \ 
(ii) MOLECULE TYPE: DNA (genAmic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
CTCCTCGAGC TAAAGATATT TTAGAGAAGA ATTAAC 

{2} INFORMATION FOR SEQ ID NO: 25: \ 
(i) SEQUENCE CHARACTERISTICS:! 

(A) LENGTH: 25 base pairi 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS: single \ 

(D) TOPOLOGY: linear \ 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TTCCTCTGGA CAGCIfGTCTA CTTTG 2 5 

(2) INFORMATION ^OR SEQ ID NO; 26: 
( i ) SEQUENCe\ CHARACTERISTICS : 

(A) LENGTH: 51 base pairs 

(B) TYP^: nucleic acid 

(C) STRA^DEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26: 
TCGAGGGGCC CAGATCTctc GCCGCTCGCG AGTCGACAAG CTTGGATCCA T 51 



(2) INFORMATION FOR ^EQ ID NO: 27: 
(i) SEQUENCE CHAfLACTERISTICS : 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY': linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGATGGATCC AAGCTTGTCG ACTCGCGAGC GGCCGCAGAT CTGGGCCCC 4 9 

(2) INFORMATION FOR SEQ tD NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES^,: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CACCGTCGTC GACTTATGCT 20 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base, pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: 'SEQ ID NO: 29: 
GACCGTCGAC TCAATTCTGG GAGAAGCTTC TTGG 3 4 

(2) INFORMATION FOR SEQ ID NO:3 0;: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singly 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CACCGTCGTC GACTTATGCT 20 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 
(i) SEQUENCE CHARACTERISTICS: 



{ A) \ LENGTH: 3 0 base pairs 

(B) \typE: nucleic acid 

(C) BTRANDEDNESS: single 

(D) VoPOLOGY: linear 

{ii) MOLECliLE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
CAACGCTCGA GAAGCAGAAT CGCAAAAGGC 

(2) INFORMATION '(FOR SEQ ID NO: 32: 
(i) SEQUENCfe CHARACTERISTICS: 

(A) LEI^GTH: 3 2 base pairs 

(B) TYI*p: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE 'TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
TCGGCTCGAG GCATCAAGGG GAAATAACTC GT 



(2) INFORMATION FOR iSEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEpNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE'; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CCGACTCGAG TCAGTAGAGG TCCTGTGCCT C 

(2) INFORMATION FOR SEQ 'ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26\base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GCGACTCGAG CATGGGGCCC TGGGGC 



) INFORMATION FOR SEQ ID NO : 3 5 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 
GCACTGGAAT TCGTCAGGGC G 

(2) INFORMATION FOR SEQ ID NO : 3'6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 



CGCGCCGCTC GAGTCTACAA TGGCCTTGAC CTTTGCTTTA CTGG 44 

(2) INFORMATlW FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) ^LENGTH: 43 base pairs 

(B) VyPE: nucleic acid 

(C) iTRANDEDNESS: single 

(D) ibpOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GCGCCCATCG ATTCaVtCCT TACTTCTTAA ACTTTCTTGC AAG 43 



(2) INFORMATION ^OR SEQ ID NO : 3 8 : 
(i) SEQUENCE\ CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPH; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPODpGY: linear 

(ii) MOLECULE tVpE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CGCGCCGCTC GAGCATCCAA TGGCCCTGTC CTTTTCTTTA CTTATGG 47 



(2) INFORMATION FOR SSQ ID NO; 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nudleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: \linear 

(ii) MOLECULE TYPE: fc)NA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCATCGATTC AATCCTTCCT CCTTAATCTT TTTTGCAAG 3 9 

(2) INFORMATION FOR SEQ Id\nO:40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 bise pairs 

(B) TYPE: nucleic\acid 

(C) STRANDEDNESS : \single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA V genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CCGGATCCTC TACAATGGCC TTGACC^TTG CTTTACTGG 3 9 

(2) INFORMATION FOR SEQ ID N0\41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base Wirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (gehomic) 
(xi) SEQUENCE DESCRIPTION: £EQ ID NO: 41: 

CGCGCCGGCG GCCGCTCATT CCTTACTTCT TAAACTTTCT TGCAAG 46 



INFORMATION FOR SEQ ID NO:42t 
(i) SEQUENCE CHARACTERISTIC^ : 

(A) LENGTH: 42 base pai\rs 

(B) TYPE: nucleic acid ' 



\ 



(C) ETRANDEDNESS : single 

(D) ioPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENQE DESCRIPTION: SEQ ID NO: 42: 
CCGGATCCCA TCCAATGGCC CTGTCCTTTT CTTTACTTAT GG 4 2 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE \CHARACTER I ST ICS : 

(A) LENGTH: 46 base pairs 

(B) TYPE:\ nucleic acid 

(C) STRANPEDNESS: single 

(D) TOPOLciGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CGCGCCGGCG GCCGCTCAAt\ CCTTCCTCCT TAATCTTTTT TGCAAG 46 

(2) INFORMATION FOR SEO ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9D80 base pairs 

(B) TYPE: nucAeic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: Mnear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRI PT^tON : SEQ ID NO: 44: 

GGGGGGGGGG GGGGGGGGGG GGGTC^^AGCAC ATCCAGTGGG TAAAGTTCCT TAAAATGCTC 60 

TGCAAAGAAA TTGGGACTTT TCATTAAATC AGAAATTTTA CTTTTTTCCC CTCCTGGGAG 12 0 

CTAAAGATAT TTTAGAGAAG AATTAACCTT TTGCTTCTCC AGTTGAACAT TTGTAGCAAT lUO 

AAGTCATGCA AATAGAGCTC TCCACCTGCT TCTTTCTGTG CCTTTTGCGA TTCTGCTTTA 240 

GTGCCACCAG AAGATACTAC CTGGGTGCAG TGGAACTGTC ATGGGACTAT ATGCAAAGTG 3 00 

ATCTCGGTGA GCTGCCTGTG GACGCAAGAT TTCCTCCTAG AGTGCCAAAA TCTTTTCCAT 3 60 

TCAACACCTC AGTCGTGTAC AAAAAGACTC TGTTTGTAGA ATTCACGGAT CACCTTTTCA 42 0 

ACATCGCTAA GCCAAGGCCA CCCTGCATcb GTCTGCTAGG TCCTACCATC CAGGCTGAGG 4S0 

TTTATGATAC AGTGGTCATT ACACTTAAGA ACATGGCTTC CCATCCTGTC AGTCTTCATG 540 

CTGTTGGTGT ATCCTACTGG AAAGCTTCTg\aGGGAGCTGA ATATGATGAT CAGACCAGTC 600 

AAAGGGAGAA AGAAGATGAT AAAGTCTTCC 'cTGGTGGAAG CCATACATAT GTCTGGCAGG 660 

TCCTGAAAGA GAATGGTCCA ATGGCCTCTG ACCCACTGTG CCTTACCTAC TCATATCTTT 72 0 

CTCATGTGGA CCTGGTAAAA GACTTGAATT cVgGCCTCAT TGGAGCCCTA CTAGTATGTA 780 

GAGAAGGGAG TCTGGCCAAG GAAAAGACAC AOACCTTGCA CAAATTTATA CTACTTTTTG H4 0 

CTGTATTTGA TGAAGGGAAA AGTTGGCACT CAGAAACAAA GAACTCCTTG ATGCAGGATA 900 

GGGATGCTGC ATCTGCTCGG GCCTGGCCTA AAATGCACAC AGTCAATGGT TATGTAAACA 960 

GGTCTCTGCC AGGTCTGATT GGATGCCACA GGaVaTCAGT CTATTGGCAT GTGATTGGAA 102 0 

TGGGCACCAC TCCTGAAGTG CACTCAATAT TCcicGAAGG TCACACATTT CTTGTGAGGA 1080 

ACCATCGCCA GGCGTCCTTG GAAATCTCGC CAAT^U^CTTT CCTTACTGCT CAAACACTCT 1140 

TGATGGACCT TGGACAGTTT CTACTGTTTT GTCAtATCTC TTCCCACCAA CATGATGGCA 12 00 

TGGAAGCTTA TGTCAAAGTA GACAGCTGTC CAGAGfeAACC CCAACTACGA ATGAAAAATA 12 60 

ATGAAGAAGC GGAAGACTAT GATGATGATC TTACTGATTC TGAAATGGAT GTGGTCAGGT 13 2 0 

TTGATGATGA CAACTCTCCT TCCTTTATCC AAATTOGCTC AGTTGCCAAG AAGCATCCTA 13 80 

AAACTTGGGT ACATTACATT GCTGCTGAAG AGGAGG^CTG GGACTATGCT CCCTTAGTCC 144 0 

TCGCCCCCGA TGACAGAAGT TATAAAAGTC AATATTTV3AA CAATGGCCCT CAGCGGATTG 1500 

GTAGGAAGTA CAAAAAAGTC CGATTTATGG CATACAC^GA TGAAACCTTT AAGACTCGTG 1560 

AAGCTATTCA GCATGAATCA GGAATCTTGG GACCTTT;k:T TTATGGGGAA GTTGGAGACA 162 0 

CACTGTTGAT TATATTTAAG AATCAAGCAA GCAGACCATA TAACATCTAC CCTCACGGAA 168 0 

TCACTGATGT CCGTCCTTTG TATTCAAGGA GATTACCA^ AGGTGTAAAA CATTTGAAGG 1740 

ATTTTCCAAT TCTGCCAGGA GAAATATTCA AATATAAATE GACAGTGACT GTAGAAGATG 1800 

GGCCAACTAA ATCAGATCCT CGGTGCCTGA CCCGCTATTA CTCTAGTTTC GTTAATATGG 1860 

AGAGAGATCT AGCTTCAGGA CTCATTGGCC CTCTCCTCAt\ CTGCTACAAA GAATCTGTAG 192 0 

ATCAAAGAGG AAACCAGATA ATGTCAGACA AGAGGAATGT \cATCCTGTTT TCTGTATTTG 1980 



ATGAGAACCG AAck:TGGTAC CTCACAGAGA ATATACAACG CTTTCTCCCC AATCCAGCTG 2 04 0 

GAGTGCAGCT TGAGGATCCA GAGTTCCAAG CCTCCAACAT CATGCACAGC ATCAATGGCT 2100 

ATGTTTTTGA TAGTTTGCAG TTGTCAGTTT GTTTGCATGA GGTGGCATAC TGGTACATTC 2160 

TAAGCATTGG AGCACAGACT GACTTCCTTT CTGTCTTCTT CTCTGGATAT ACCTTCAAAC 22 2 0 

ACAAAATGGT CTATGAAGAC ACACTCACCC TATTCCCATT CTCAGGAGAA ACTGTCTTCA 22 80 

TGTCGATGGA AAACCCAGGT CTATGGATTC TGGGGTGCCA CAACTCAGAC TTTCGGAACA 2 3 40 

GAGGCATGAC CGCCTTACTG AAGGTTTCTA GTTGTGACAA GAACACTGGT GATTATTACG 2 400 

AGGACAGTTA TGAAGATATT TCAGCATACT TGCTGAGTAA AAACAATGCC ATTGAACCAA 2 460 

GAAGCTTCTC CCAGAATTCA AGACACCCTA GCACTAGGCA AAAGCAATTT AATGCCACCA 2 52 0 

CAATTCCAGA AAATGACATA GAGAAGACTG ACCCTTGGTT TGCACACAGA ACACCTATGC 2 580 

CTAAAATACA AAATGTCTCC TCTAGTGATT TGTTGATGCT CTTGCGACAG AGTCCTACTC 2 640 

CACATGGGCT ATCCTTATCT GATCTCCAAG AAGCCAAATA TGAGACTTTT TCTGATGATC 2 7 00 

CATCACCTGG AGCAATAGAC AGTAATAACA GCCTGTCTGA AATGACACAC TTCAGGCCAC 2 7 60 

AGCTCCATCA CAGTGGGGAC ATGGTATTTA CCCCTGAGTC AGGCCTCCAA TTAAGATTAA 2 82 0 

ATGAGAAACT GGGGACAACT GCAGCAACAG AGTTGAAGAA ACTTGATTTC AAAGTTTCTA 2 880 

GTACATCAAA TAATCTGATT TCAACAATTC CATCAGACAA TTTGGCAGCA GGTACTGATA 2 94 0 

ATACAAGTTC CTTAGGACCC CCAAGTATGC CAGTTCATTA TGATAGTCAA TTAGATACCA 3 000 

CTCTATTTGG CAAAAAGTCA TCTCCCCTTA CTGAGTCTGG TGGACCTCTG AGCTTGAGTG 3 060 

AAGAAAATAA TGATTCAAAG TTGTTAGAAT CAGGTTTAAT GAATAGCCAA GAAAGTTCAT 312 0 

GGGGAAAAAA TGTATCGTCA ACAGAGAGTG GTAGGTTATT TAAAGGGAAA AGAGCTCATG 3180 

GACCTGCTTT GTTGACTAAA GATAATGCCT TATTCAAAGT TAGCATCTCT TTGTTAAAGA 3 2 40 

CAAACAAAAC TTCCAATAAT TCAGCAACTA ATAGAAAGAC TCACATTGAT GGCCCATCAT 3 3 00 

TATTAATTGA GAATAGTCCA TCAGTCTGGC AAAATATATT AGAAAGTGAC ACTGAGTTTA 3360 

AAAAAGTGAC ACCTTTGATT CATGACAGAA TGCTTATGGA CAAAAATGCT ACAGCTTTGA 3 42 0 

GGCTAAATCA TATGTCAAAT AAAACTACTT CATCAAAAAA CATGGAAATG GTCCAACAGA 3 480 

AAAAAGAGGG CCCCATTCCA CCAGATGCAC AAAATCCAGA TATGTCGTTC TTTAAGATGC 3 540 

TATTCTTGCC AGAATCAGCA AGGTGGATAC AAAGGACTCA TGGAAAGAAC TCTCTGAACT 3 600 

CTGGGCAAGG CCCCAGTCCA AAGCAATTAG TATCCTTAGG ACCAGAAAAA TCTGTGGAAG 3 660 

GTCAGAATTT CTTGTCTGAG AAAAACAAAG TGGTAGTAGG AAAGGGTGAA TTTACAAAGG 3 72 0 

ACGTAGGACT CAAAGAGATG GTTTTTCCAA GCAGCAGAAA CCTATTTCTT ACTAACTTGG 3 7 80 

ATAATTTACA TGAAAATAAT ACACACAATC AAGAAAAAAA AATTCAGGAA GAAATAGAAA 3 840 

AGAAGGAAAC ATTAATCCAA GAGAATGTAG TTTTGCCTCA GATACATACA GTGACTGGCA 3 900 

CTAAGAATTT CATGAAGAAC CTTTTCTTAC TGAGCACTAG GCAAAATGTA GAAGGTTCAT 3 960 

ATGACGGGGC ATATGCTCCA GTACTTCAAG ATTTTAGGTC ATTAAATGAT TCAACAAATA 4 02 0 

GAACAAAGAA ACACACAGCT CATTTCTCAA AAAAAGGGGA GGAAGAAAAC TTGGAAGGCT 4080 

. ^ TGGGAAATCA AACCAAGCAA ATTGTAGAGA AATATGCATG CACCACAAGG ATATCTCCTA 4140 

yjJ ATACAAGCCA GCAGAATTTT GTCACGCAAC GTAGTAAGAG AGCTTTGAAA CAATTCAGAC 4200 

TCCCACTAGA AGAAACAGAA CTTGAAAAAA GGATAATTGT GGATGACACC TCAACCCAGT 42 6 0 

■^C)^ GGTCCAAAAA CATGAAACAT TTGACCCCGA GCACCCTCAC ACAGATAGAC TACAATGAGA 43 2 0 

AGGAGAAAGG GGCCATTACT CAGTCTCCCT TATCAGATTG CCTTACGAGG AGTCATAGCA 43 SO 

Ij TCCCTCAAGC AAATAGATCT CCATTACCCA TTGCAAAGGT ATCATCATTT CCATCTATTA 444 0 

GACCTATATA TCTGACCAGG GTCCTATTCC AAGACAACTC TTCTCATCTT CCAGCAGCAT 4500 

CTTATAGAAA GAAAGATTCT GGGGTCCAAG AAAGCAGTCA TTTCTTACAA GGAGCCAAAA 4560 

AAAATAACCT TTCTTTAGCC ATTCTAACCT TGGAGATGAC TGGTGATCAA AGAGAGGTTG 4 62 0 

GCTCCCTGGG GACAAGTGCC ACAAATTCAG TCACATACAA GAAAGTTGAG AACACTGTTC 46 80 

TCCCGAAACC AGACTTGCCC AAAACATCTG GCAAAGTTGA ATTGCTTCCA AAAGTTCACA 47 40 

TTTATCAGAA GGACCTATTC CCTACGGAAA CTAGCAATGG GTCTCCTGGC CATCTGGATC 4800 

TCGTGGAAGG GAGCCTTCTT CAGGGAACAG AGGGAGCGAT TAAGTGGAAT GAAGCAAACA 4860 

GACCTGGAAA AGTTCCCTTT CTGAGAGTAG CAACAGAAAG CTCTGCAAAG ACTCCCTCCA 492 0 

AGCTATTGGA TCCTCTTGCT TGGGATAACC ACTATGGTAC TCAGATACCA AAAGAAGAGT 4980 

GGAAATCCCA AGAGAAGTCA CCAGAAAAAA CAGCTTTTAA GAAAAAGGAT ACCATTTTGT 5040 

CCCTGAACGC TTGTGAAAGC AATCATGCAA TAGCAGCAAT AAATGAGGGA CAAAATAAGC 510 0 

CCGAAATAGA AGTCACCTGG GCAAAGCAAG GTAGGACTGA AAGGCTGTGC TCTCAAAACC 5160 

CACCAGTCTT GAAACGCCAT CAACGGGAAA TAACTCGTAC TACTCTTCAG TCAGATCAAG 52 2 0 

AGGAAATTGA CTATGATGAT ACCATATCAG TTGAAATGAA GAAGGAAGAT TTTGACATTT 5280 

ATGATGAGGA TGAAAATCAG AGCCCCCGCA GCTTTCAAAA GAAAACACGA CACTATTTTA 53 4 0 

TTGCTGCAGT GGAGAGGCTC TGGGATTATG GGATGAGTAG CTCCCCACAT GTTCTAAGAA 5400 



ACAGGGCTCA GAGTGGCAGT GTCCCTCAGT TCAAGAAAGT TGTTTTCCAG GAATTTACTG 5460 

ATGGCTCCTT TACTCAGCCC TTATACCGTG GAGAACTAAA TGAACATTTG GGACTCCTGG 552 0 

GGCCATATAT AAGAGCAGAA GTTGAAGATA ATATCATGGT AACTTTCAGA AATCAGGCCT 5580 

CTCGTCCCTA TTCCTTCTAT TCTAGCCTTA TTTCTTATGA GGAAGATCAG AGGCAAGGAG 5640 

CAGAACCTAG AAAAAACTTT GTCAAGCCTA ATGAAACCAA AACTTACTTT TGGAAAGTGC 57 00 

AACATCATAT GGCACCCACT AAAGATGAGT TTGACTGCAA AGCCTGGGCT TATTTCTCTG 57 60 

ATGTTGACCT GGAAAAAGAT GTGCACTCAG GCCTGATTGG ACCCCTTCTG GTCTGCCACA 582 0 

CTAACACACT GAACCCTGCT CATGGGAGAC AAGTGACAGT ACAGGAATTT GCTCTGTTTT 5880 

TCACCATCTT TGATGAGACC AAAAGCTGGT ACTTCACTGA AAATATGGAA AGAAACTGCA 5 94 0 

GGGCTCCCTG CAATATCCAG ATGGAAGATC CCACTTTTAA AGAGAATTAT CGCTTCCATG 6000 

CAATCAATGG CTACATAATG GATACACTAC CTGGCTTAGT AATGGCTCAG GATCAAAGGA 6060 

TTCGATGGTA TCTGCTCAGC ATGGGCAGCA ATGAAAACAT CCATTCTATT CATTTCAGTG 612 0 

GACATGTGTT CACTGTACGA AAAAAAGAGG AGTATAAAAT GGCACTGTAC AATCTCTATC 6180 

CAGGTGTTTT TGAGACAGTG GAAATGTTAC CATCCAAAGC TGGAATTTGG CGGGTGGAAT 6240 

GCCTTATTGG CGAGCATCTA CATGCTGGGA TGAGCACACT TTTTCTGGTG TACAGCAATA 63 00 

AGTGTCAGAC TCCCCTGGGA ATGGCTTCTG GACACATTAG AGATTTTCAG ATTACAGCTT 63 60 

CAGGACAATA TGGACAGTGG GCCCCAAAGC TGGCCAGACT TCATTATTCC GGATCAATCA 6420 

ATGCCTGGAG CACCAAGGAG CCCTTTTCTT GGATCAAGGT GGATCTGTTG GCACCAATGA 64 RO 

TTATTCACGG CATCAAGACC CAGGGTGCCC GTCAGAAGTT CTCCAGCCTC TACATCTCTC 6 54 0 

AGTTTATCAT CATGTATAGT CTTGATGGGA AGAAGTGGCA GACTTATCGA GGAAATTCCA 6 600 

CTGGAACCTT AATGGTCTTC TTTGGCAATG TGGATTCATC TGGGATAAAA CACAATATTT 6 660 

TTAACCCTCC AATTATTGCT CGATACATCC GTTTGCACCC AACTCATTAT AGCATTCGCA 67 2 0 

GCACTCTTCG CATGGAGTTG ATGGGCTGTG ATTTAAATAG TTGCAGCATG CCATTGGGAA 678 0 

TGGAGAGTAA AGCAATATCA GATGCACAGA TTACTGCTTC ATCCTACTTT ACCAATATGT 6840 

TTGCCACCTG GTCTCCTTCA AAAGCTCGAC TTCACCTCCA AGGGAGGAGT AATGCCTGGA 690 0 

GACCTCAGGT GAATAATCCA AAAGAGTGGC TGCAAGTGGA CTTCCAGAAG ACAATGAAAG 6960 

TCACAGGAGT AACTACTCAG GGAGTAAAAT CTCTGCTTAC CAGCATGTAT GTGAAGGAGT 7U20 

TCCTCATCTC CAGCAGTCAA GATGGCCATC AGTGGACTCT CTTTTTTCAG AATGGCAAAG 7 080 

TAAAGGTTTT TCAGGGAAAT CAAGACTCCT TCACACCTGT GGTGAACTCT CTAGACCCAC 7140 

CGTTACTGAC TCGCTACCTT CGAATTCACC CCCAGAGTTG GGTGCACCAG ATTGCCCTGA 7200 

GGATGGAGGT TCTGGGCTGC GAGGCACAGG ACCTCTACTG AGGGTGGCCA CTGCAGCACC 72 60 

TGCCACTGCC GTCACCTCTC CCTCCTCAGC TCCAGGGCAG TGTCCCTCCC TGGCTTGCCT 7320 

TCTACCTTTG TGCTAAATCC TAGCAGACAC TGCCTTGAAG CCTCCTGAAT TAACTATCAT 73 80 

CAGTCCTGCA TTTCTTTGGT GGGGGGCCAG GAGGGTGCAT CCAATTTAAC TTAACTCTTA 7 4 40 

CCTATTTTCT GCAGCTGCTC CCAGATTACT CCTTCCTTCC AATATAACTA GGCAAAAAGA 7 5 00 

AGTGAGGAGA AACCTGCATG AAAGCATTCT TCCCTGAAAA GTTAGGCCTC TCAGAGTCAC 7 560 

CACTTCCTCT GTTGTAGAAA AACTATGTGA TGAAACTTTG AAAAAGATAT TTATGATGTT 7 620 

AACATTTCAG GTTAAGCCTC ATACGTTTAA AATAAAACTC TCAGTTGTTT ATTATCCTGA 7 680 

TCAAGCATGG AACAAAGCAT GTTTCAGGAT CAGATCAATA CAATCTTGGA GTCAAAAGGC 7 7 40 

AAATCATTTG GACAATCTGC AAAATGGAGA GAATACAATA ACTACTACAG TAAAGTCTGT 7 8 00 

TTCTGCTTCC TTACACATAG ATATAATTAT GTTATTTAGT CATTATGAGG GGCACATTCT 7 860 

TATCTCCAAA ACTAGCATTC TTAAACTGAG AATTATAGAT GGGGTTCAAG AATCCCTAAG 7 920 

TCCCCTGAAA TTATATAAGG CATTCTGTAT AAATGCAAAT GTGCATTTTT CTGACGAGTG 7980 

TCCATAGATA TAAAGCCATT TGGTCTTAAT TCTGACCAAT AAAAAAATAA GTCAGGAGGA 804 0 

TGCAATTGTT GAAAGCTTTG AAATAAAATA ACAATGTCTT CTTGAAATTT GTGATGGCCA 8100 

AGAAAGAAAA TGATGATGAC ATTAGGCTTC TAAAGGACAT ACATTTAATA TTTCTGTGGA 8160 

AATATGAGGA AAATCCATGG TTATCTGAGA TAGGAGATAC AAACTTTGTA ATTCTAATAA 8220 

TGCACTCAGT TTACTCTCTC CCTCTACTAA TTTCCTGCTG AAAATAACAC AACAAAAATG 82 80 

TAACAGGGGA AATTATATAC CGTGACTGAA AACTAGAGTC CTACTTACAT AGTTGAAATA 8340 

TCAAGGAGGT CAGAAGAAAA TTGGACTGGT GAAAACAGAA AAAACACTCC AGTCTGCCAT 8400 

ATCACCACAC AATAGGATCC CCCTTCTTGC CCTCCACCCC CATAAGATTG TGAAGGGTTT 84 60 

ACTGCTCCTT CCATCTGCCT GACCCCTTCA CTATGACTAC ACAGAATCTC CTGATAGTAA 8520 

AGGGGGCTGG AGGCAAGGAT AAGTTATAGA GCAGTTGGAG GAAGCATCCA AAGATTGCAA 8580 

CCCAGGGCAA ATGGAAAACA GGAGATCCTA ATATGAAAGA AAAATGGATC CCAATCTGAG 864 0 

AAAAGGCAAA AGAATGGCTA CTTTTTTCTA TGCTGGAGTA TTTTCTAATA ATCCTGCTTG 87 0 0 

ACCCTTATCT GACCTCTTTG GAAACTATAA CATAGCTGTC ACAGTATAGT CACAATCCAC 87 60 

AAATGATGCA GGTGCAAATG GTTTATAGCC CTGTGAAGTT CTTAAAGTTT AGAGGCTAAC 8 820 



TTACAGAAAT GAATAAGTTG "tTTTTCTTTTA TAGCCCGGTA GAGGAGTTAA CCCCAAAGGT 8880 
GATATGGTTT TATTTCCTGT TATGTTTAAC TTAATAATCT TATTTTGGCA TTCTTTTCCC 8940 
ATTGACTATA TACATCTCTA TTTTCTCAAAT GTTCATGGAA CTAGCTCTTT TATTTTCCTG 9000 
CTGGTTTCTT CAGTAATGAG TTAAATAAAA CATTGACACA TACAAAAAAA AAAAAAAAAA 9 060 

AAAAAAAAAA AAAAAAAAAA \ 9080 

\ 

(2) INFORMATION FOR SEQ NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2351 amino acids 

(B) TYPE: amino Wcid 

(C) STRANDEDNESs\ single 

(D) TOPOLOGY: liripar 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTIci^ : SEQ ID NO: 45: 

Met Gin lie Glu Leu Ser \,Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1 5 \ 10 15 



Cys Phe Ser Ala Thr Arg Atg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 \ 25 30 



Trp Asp Tyr Met Gin Ser Asb Leu Gly Glu Leu Pro Val Asp Ala Arg 
35 \ 40 45 



Phe Pro Pro Arg Val Pro Lys 'ySer Phe Pro Phe Asn Thr Ser Val Val 
50 55 \ 60 

Tyr Lys Lys Thr Leu Phe Val g\Lu Phe Thr Asp His Leu Phe Asn He 

65 7 0 \ 7 5 80 

\ 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr He Gin 
85 \ 90 95 

Ala Glu Val Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser 
100 l\05 110 

His Pro Val Ser Leu His Ala Val GlV Val Ser Tyr Trp Lys Ala Ser 
115 120 \ 125 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr \ Ser Gin Arg Glu Lys Glu Asp 
130 135 \ 140 

Asp Lys Val Phe Pro Gly Gly Ser His -^r Tyr Val Trp Gin Val Leu 
145 150 \ 155 160 

Lys Glu Asn Gly Pro Met Ala Ser Asp PrV) Leu Cys Leu Thr Tyr Ser 
165 170 175 

Tyr Leu Ser His Val Asp Leu Val Lys AsplLeu Asn Ser Gly Leu He 
180 185 \ 190 

Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 \ 205 



Gin Thr Leu His Lys Phe He Leu Leu Phe Ala Val Phe Asp Glu Gly 
210 215 220 



Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 \ 230 235 240 

Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 
245 250 255 

Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val 
260 265 270 

Tyr Trp His Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser He 
275 280 285 

Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 
290 295 300 

Leu Glu lie Ser Pro He Thr Phe Leu Thr Ala Gin Thr Leu Leu Met 
305 310 315 320 

Asp Leu Gly Gin Phe Leu Leu Phe Cys His He Ser Ser His Gin His 
325 330 335 

Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
340 345 350 

Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 

Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
370 375 380 

Pro Ser Phe He Gin He Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 400 

Trp Val His Tyr He Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 
405 410 415 

Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 
420 425 430 

Asn Gly Pro Gin Arg He Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
435 440 445 

Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala He Gin His Glu 
450 455 460 

Ser Gly He Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 

Leu He He Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn He Tyr Pro 
485 490 495 

His Gly He Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
500 505 510 



Gly Val Lys His Leu Lys Asp Phe Pro He Leu Pro Gly Glu He Phe 
515 520 525 



Lys Tyr Lys "ffrp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 

530 \ 535 540 

Pro Arg Cys Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 

545 \ 550 555 560 

Asp Leu Ala Ser^Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu 

\565 570 575 



Ser Val Asp Gin Atrg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val 

580 \^ 585 590 

\ 

lie Leu Phe Ser Va:\ Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 \ 600 605 



Asn lie Gin Arg Phe Veu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 

610 '\ 615 620 



Pro Glu Phe Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val 

625 6i,0 635 640 

\ 

Phe Asp Ser Leu Gin Leu''^ Ser Val Cys Leu His Glu Val Ala Tyr Trp 
645 \ 650 655 

\ 

Tyr lie Leu Ser lie Gly AXa Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 



Ser Gly Tyr Thr Phe Lys Hi^ Lys Met Val Tyr Glu Asp Thr Leu Thr 



675 



680 



685 



Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 

690 695 \ 700 

Gly Leu Trp lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705 710 y 715 720 

\ 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 



725 



730 



735 



Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

Asn Asn Ala lie Glu Pro Arg Ser Phe\ser Gin Asn Ser Arg His Pro 

755 760 \ 765 

Ser Thr Arg Gin Lys Gin Phe Asn Ala T^ir Thr lie Pro Glu Asn Asp 
770 775 \ 780 

lie Glu Lys Thr Asp Pro Trp Phe Ala His\ Arg Thr Pro Met Pro Lys 
785 790 \795 800 

lie Gin Asn Val Ser Ser Ser Asp Leu Leu I^et Leu Leu Arg Gin Ser 
805 810 \ 815 



Pro Thr Pro His Gly Leu Ser Leu Ser Asp LeV Gin Glu Ala Lys Tyr 
820 825 830 



Glu Thr Pha Ser Asp Asp Pro Ser Pro Gly Ala lie Asp Ser Asn Asn 
835\ 840 845 

Ser Leu Ser Glu Met Thr His Phe Arg Pro Gin Leu His His Ser Gly 
850 855 860 

Asp Met Val Phe Thr Pro Glu Ser Gly Leu Gin Leu Arg Leu Asn Glu 
865 870 875 880 

Lys Leu Gly Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys 
385 890 895 

Val Ser Ser Thr Ser Asn Asn Leu lie Ser Thr lie Pro Ser Asp Asn 
900 905 910 

Leu Ala Ala Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met 
915 ' 920 925 

Pro Val His Tyr Asp Ser Gin Leu Asp Thr Thr Leu Phe Gly Lys Lys 
930 935 940 

Ser Ser Pro Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Glu Glu 
945 950 955 960 

Asn Asn Asp Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gin Glu 
965 970 975 

Ser Ser Trp Gly Lys Asn Val Ser Ser Thr Glu Ser Gly Arg Leu Phe 
980 985 990 

Lys Gly Lys Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn Ala 
995 1000 1005 

Leu Phe Lys Val Ser lie Ser Leu Leu Lys Thr Asn Lys Thr Ser Asn 
1010 1015 1020 

Asn Ser Ala Thr Asn Arg Lys Thr His lie Asp Gly Pro Ser Leu Leu 
1025 1030 1035 1040 

lie Glu Asn Ser Pro Ser Val Trp Gin Asn lie Leu Glu Ser Asp Thr 
1045 1050 1055 

Glu Phe Lys Lys Val Thr Pro Leu lie His Asp Arg Met Leu Met Asp 
1060 1065 1070 

Lys Asn Ala Thr Ala Leu Arg Leu Asn His Met Ser Asn Lys Thr Thr 
1075 1080 1085 

Ser Ser Lys Asn Met Glu Met Val Gin Gin. Lys Lys Glu Gly Pro lie 
1090 1095 1100 

Pro Pro Asp Ala Gin Asn Pro Asp Met Ser Phe Phe Lys Met Leu Phe 
1105 1110 13^15 1120 



Leu Pro Glu Ser Ala Arg Trp lie Gin Arg Thr His Gly Lys Asn Ser 
1125 1130 1135 



Leu Asn SerXcly Gin Gly Pro Ser Pro Lys Gin Leu Val Ser Leu Gly 
Vl40 1145 1150 



Pro Glu Lys S^r Val Glu Gly Gin Asn Phe Leu Ser Glu Lys Asn Lys 
1155 \ 1160 1165 

Val Val Val Gly -Lys Gly Glu Phe Thr Lys Asp Val Gly Leu Lys Glu 
1170 1175 1180 

Met Val Phe Pro Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn 
1185 \ 1190 1195 1200 

>. 

Leu His Glu Asn Asn Thr His Asn Gin Glu Lys Lys lie Gin Glu Glu 
1205\ 1210 1215 

lie Glu Lys Lys Glu Thr Leu lie Gin Glu Asn Val Val Leu Pro Gin 
1220 ■ 1225 1230 

lie His Thr Val Thr Gly 'Thr Lys Asn Phe Met Lys Asn Leu Phe Leu 
1235 1240 1245 

Leu Ser Thr Arg Gin Asn Val Glu Gly Ser Tyr Asp Gly Ala Tyr Ala 
1250 12^5 1260 

Pro Val Leu Gin Asp Phe Arg, Ser Leu Asn Asp Ser Thr Asn Arg Thr 
1265 1270 \ 1275 1280 

Lys Lys His Thr Ala His Phe ^er Lys Lys Gly Glu Glu Glu Asn Leu 
1285 \ 1290 1295 

Glu Gly Leu Gly Asn Gin Thr Ly^ Gin lie Val Glu Lys Tyr Ala Cys 
13 00 -1305 1310 

Thr Thr Arg lie Ser Pro Asn Thr Ser Gin Gin Asn Phe Val Thr Gin 
1315 1320 \ 1325 

Arg Ser Lys Arg Ala Leu Lys Gin Ph^ Arg Leu Pro Leu Glu Glu Thr 

1330 1335 1340 

\ 

Glu Leu Glu Lys Arg lie lie Val Asp ksp Thr Ser Thr Gin Trp Ser 
1345 1350 \ 1355 1360 

Lys Asn Met Lys His Leu Thr Pro Ser TtiT Leu Thr Gin lie Asp Tyr 
1365 13i0 1375 

Asn Glu Lys Glu Lys Gly Ala lie Thr Gin \Ser Pro Leu Ser Asp Cys 

1380 1385 \ 1390 

\ 

Leu Thr Arg Ser His Ser lie Pro Gin Ala A^n Arg Ser Pro Leu Pro 
1395 1400 \ 1405 

lie Ala Lys Val Ser Ser Phe Pro Ser lie Arg\ Pro lie Tyr Leu Thr 
1410 1415 \l420 

\ 

Arg Val Leu Phe Gin Asp Asn Ser Ser His Leu E^oro Ala Ala Ser Tyr 
1425 1430 1435 \ 1440 



\ 



Arg Lys Lys Asp^Ser Gly Val Gin Glu Ser Ser His Phe Leu Gin Gly 
1445 1450 1455 

Ala Lys Lys Asn Asn Leu Ser Leu Ala lie Leu Thr Leu Glu Met Thr 
1460 1465 1470 

Gly Asp Gin Arg Glu Val Gly Ser Leu Gly Thr Ser Ala Thr Asn Ser 
1475 1480 1485 

Val Thr Tyr Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp Leu 
1490 1495 1500 

Pro Lys Thr Ser Gly Lys Val Glu Leu Leu Pro Lys Val His lie Tyr 
1505 1510 1515 1520 

Gin Lys Asp Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser Pro Gly His 
1525 1530 1535 

Leu Asp Leu Val Glu Gly Ser Leu Leu Gin Gly Thr Glu Gly Ala lie 
1540 1545 1550 

Lys Trp Asn Glu Ala Asn Arg Pro Gly Lys Val Pro Phe Leu Arg Val 
1555 1560 1565 

Ala Thr Glu Ser Ser Ala Lys Thr Pro Ser Lys Leu Leu Asp Pro Leu 
1570 1575 1580 

Ala Trp Asp Asn His Tyr Gly Thr Gin lie Pro Lys Glu Glu Trp Lys 
1585 1590 1595 1600 

Ser Gin Glu Lys Ser Pro Glu Lys Thr Ala Phe Lys Lys Lys Asp Thr 
1605 1610 1615 

He Leu Ser Leu Asn Ala Cys Glu Ser Asn His Ala He Ala Ala He 
1520 1625 1630 

Asn Glu Gly Gin Asn Lys Pro Glu He Glu Val Thr Trp Ala Lys Gin 
1635 1640 1645 

Gly Arg Thr Glu Arg Leu Cys Ser Gin Asn Pro Pro Val Leu Lys Arg 
1650 1655 1660 

His Gin Arg Glu He Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu 
1665 1670 1675 1680 

He Asp Tyr Asp Asp Thr He Ser Val Glu Met Lys Lys Glu Asp Phe 
1685 1690 1695 

Asp He Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys 
1700 1705 1710 

Lys Thr Arg His Tyr Phe He Ala Ala Val Glu Arg Leu Trp Asp Tyr 
1715 1720 1725 



Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser Gly 
1730 1735 1740 



Ser Val Pro OLln Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly 
1745 \ 1750 1755 1760 

Ser Phe Thr Gli\ Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly 
1765 1770 1775 

Leu Leu Gly Pro Vyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val 

1780 \^ 1785 1790 

Thr Phe Arg Asn 01^1 Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu 

1795 \ 1800 1805 

lie Ser Tyr Glu G1u\asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn 



1810 



\ 



1815 



1820 



Phe Val Lys Pro Asn (ilu Thr Lys Thr Tyr Phe Trp Lys Val Gin His 
1825 1^30 1835 1840 

\ 

His Met Ala Pro Thr Ly^ Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr 
1845 \ 1850 1855 

\ 

Phe Ser Asp Val Asp Leu ^lu Lys Asp Val His Ser Gly Leu lie Gly 
1860 ' 1865 1870 

Pro Leu Leu Val Cys His Tht Asn Thr Leu Asn Pro Ala His Gly Arg 
1875 \l880 1885 

Gin Val Thr Val Gin Glu Phe !kla Leu Phe Phe Thr He Phe Asp Glu 



1890 



1895 



1900 



Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala 
1905 1910 \ 1915 1920 



Pro Cys Asn He Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg 
1925 \ 1930 1935 



Phe His Ala He Asn Gly Tyr He Md^t Asp Thr Leu Pro Gly Leu Val 
1940 19i|5 1950 

Met Ala Gin Asp Gin Arg He Arg Trp\Tyr Leu Leu Ser Met Gly Ser 
1955 1960 \ 1965 

\ 

Asn Glu Asn He His Ser He His Phe ^r Gly His Val Phe Thr Val 
1970 1975 \ 1980 

Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu, Tyr Asn Leu Tyr Pro Gly 
1985 1990 1995 2000 

Val Phe Glu Thr Val Glu Met Leu Pro Ser l^ys Ala Gly He Trp Arg 
2005 2010 2015 

Val Glu Cys Leu He Gly Glu His Leu His Ala Gly Met Ser Thr Leu 
2020 2025 2030 



Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser 
2035 2040 2045 



Gly His lie Arg Vsp Phe Gin lie Thr Ala Ser Gly Gin Tyr Gly Gin 
2050 \ 2055 2060 

Trp Ala Pro Lys Lau Ala Arg Leu His Tyr Ser Gly Ser lie Asn Ala 
2065 \ 2070 2075 2080 

Trp Ser Thr Lys Glu\ Pro Phe Ser Trp lie Lys Val Asp Leu Leu Ala 
2085 2090 2095 

Pro Met lie lie His \ly lie Lys Thr Gin Gly Ala Arg Gin Lys Phe 
2100 \ 2105 2110 

Ser Ser Leu Tyr lie sir Gin Phe lie lie Met Tyr Ser Leu Asp Gly 
2115 \ 2120 2125 

Lys Lys Trp Gin Thr Tyii Arg Gly Asn Ser Thr Gly Thr Leu Met Val 
2130 \ 2135 2140 



Phe Phe Gly Asn Val Asp \Ser Ser Gly lie Lys His Asn lie Phe Asn 



2145 



ASp li: 

2150\ 



2155 



2160 



Pro Pro lie lie Ala Arg TVr lie Arg Leu His Pro Thr His Tyr Ser 
2165 \ 2170 2175 



lie Arg Ser Thr Leu Arg Met^ Glu Leu Met Gly Gys Asp Leu Asn Ser 
2180 \ 2185 2190 



Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp Ala Gin 
2195 2200 2205 



lie Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro 
2210 2215 \ 2220 



Ser Lys Ala Arg Leu His Leu Gin \Gly Arg Ser Asn Ala Trp Arg Pro 
2225 2230 \ 2235 2240 

Gin Val Asn Asn Pro Lys Glu Trp L^u Gin Val Asp Phe Gin Lys Thr 
2245 \ 2250 2255 

Met Lys Val Thr Gly Val Thr Thr Gl^ Gly Val Lys Ser Leu Leu Thr 
2260 2205 2270 



Ser Met Tyr Val Lys Glu Phe Leu lie \ser Ser Ser Gin Asp Gly His 



2275 



2280 



Gin Trp Thr Leu Phe Phe Gin Asn Gly LVs Val Lys Val Phe Gin Gly 
2290 2295 \ 2300 



Asn Gin Asp Ser Phe Thr Pro Val Val AsA Ser Leu Asp Pro Pro Leu 
2305 2310 \23i5 2320 

Leu Thr Arg Tyr Leu Arg lie His Pro Gln\ser Trp Val His Gin lie 
2325 233A 2335 



Ala Leu Arg Met Glu Val Leu Gly Cys Glu 
2340 2345 



la Gin Asp 



Leu Tyr 
2350 



(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 

(B) TYPE :\ nucleic acid 

(C) STRANQEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYffE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CTCGAGCTAA AGATATTTTa\gAGAAGAATT AACCTTTTGC TTCTCCA(3TT GAACATTTGT 60 

AGCAATAAGT CATGCAAATA GAGCTCTCCA CCTGCTTCTT TCTGTGCCTT TTGCGATTCT 12 0 

GCTTTAGTGC CACCAGAAGA TACTACCTGG GTGCAGTGGA ACTGTCATGG GACTATATGC 18 0 

AAAGTGATCT CGGTGAGCTG QCTGTGGACG CAAGATTTCC TCCTAGAGTG CCAAAATCTT 240 

TTCCATTCAA CACCTCAGTC GTGTACAAAA AGACTCTGTT TGTAGAATTC ACGGATCACC 3 00 

TTTTCAACAT CGCTAAGCCA AGGCCACCCT GGATGGGTCT GCTAGGTCCT ACCATCCAGG 3 60 

CTGAGGTTTA TGATACAGTG CTCATTACAC TTAAGAACAT GGCTTCCCAT CCTGTCAGTC 42 0 

TTCATGCTGT TGGTGTATCC TA(^TGGAAAG CTTCTGAGGG AGCTGAATAT GATGATCAGA 480 

CCAGTCAAAG GGAGAAAGAA GATCATAAAG TCTTCCCTGG TGGAAGCCAT ACATATGTCT 54 0 

GGCAGGTCCT GAAAGAGAAT GGT&CAATGG CCTCTGACCC ACTGTGCCTT ACCTACTCAT 600 

ATCTTTCTCA TGTGGACCTG GTAA^AGACT TGAATTCAGG CCTCATTGGA GCCCTACTAG 660 

TATGTAGAGA AGGGAGTCTG (^CCaAgGAAA AGACACAGAC CTTGCACAAA TTTATACTAC 72 0 

TTTTTGCTGT ATTTGATGAA GGGAAAAGTT GGCACTCAGA AACAAAGAAC TCCTTGATGC 7 80 

AGGATAGGGA TGCTGCATCT GCTCGg\5CCT GGCCTAAAAT GCACACAGTC AATGGTTATG 840 

TAAACAGGTC TCTGCCAGGT CTGATTCiGAT GCCACAGGAA ATCAGTCTAT TGGCATGTGA 9 00 

TTGGAATGGG CACCACTCCT GAAGTGO^CT CAATATTCCT CGAAGGTCAC ACATTTCTTG 9 60 

TGAGGAACCA TCGCCAGGCG TCCTTGgAaA TCTCGCCAAT AACTTTCCTT ACTGCTCAAA 102 0 

CACTCTTGAT GGACCTTGGA CAGTTTCTkc TGTTTTGTCA TATCTCTTCC CACCAACATG 1080 

ATGGCATGGA AGCTTATGTC AAAGTAGAOA GCTGTCCAGA GGAACCCCAA CTACGAATGA 114 0 

AAAATAATGA AGAAGCGGAA GACTATGATG ATGATCTTAC TGATTCTGAA ATGGATGTGG 12 uo 

TCAGGTTTGA TGATGACAAC TCTCCTTCci TTATCCAAAT TCGCTCAGTT GCCAAGAAGC 12 60 

ATCCTAAAAC TTGGGTACAT TACATTGCTg\ CTGAAGAGGA GGACTGGGAC TATGCTCCCT 132 0 

TAGTCCTCGC CCCCGATGAC AGAAGTTATA WvAGTCAATA TTTGAACAAT GGCCCTCAGC 13 80 

GGATTGGTAG GAAGTACAAA AAAGTCCGAT TTATGGCATA CACAGATGAA ACCTTTAAGA 144 0 

CTCGTGAAGC TATTCAGCAT GAATCAGGAA T^TTGGGACC TTTACTTTAT GGGGAAGTTG 1500 

GAGACACACT GTTGATTATA TTTAAGAATC AAGCAAGCAG ACCATATAAC ATCTACCCTC 1560 

ACGGAATCAC TGATGTCCGT CCTTTGTATT C;^GGAGATT ACCAAAAGGT GTAAAACATT 162 0 

TGAAGGATTT TCCAATTCTG CCAGGAGAAA TATTCAAATA TAAATGGACA GTGACTGTAG 1680 

AAGATGGGCC AACTAAATCA GATCCTCGGT GCcVgACCCG CTATTACTCT AGTTTCGTTA 17 40 

ATATGGAGAG AGATCTAGCT TCAGGACTCA TTGQCCCTCT CCTCATCTGC TACAAAGAAT 1800 

CTGTAGATCA AAGAGGAAAC CAGATAATGT CAGACAAGAG GAATGTCATC CTGTTTTCTG 1860 

TATTTGATGA GAACCGAAGC TGGTACCTCA CAGAcWtAT ACAACGCTTT CTCCCCAATC 1920 

CAGCTGGAGT GCAGCTTGAG GATCCAGAGT TCCAAGCCTC CAACATCATG CACAGCATCA 198 0 

ATGGCTATGT TTTTGATAGT TTGCAGTTGT CAGTTTIGTTT GCATGAGGTG GCATACTGGT 2 04 0 

ACATTCTAAG CATTGGAGCA CAGACTGACT TCCTTTCTGT CTTCTTCTCT GGATATACCT 2100 

TCAAACACAA AATGGTCTAT GAAGACACAC TCACCCTATT CCCATTCTCA GGAGAAACTG 2160 

TCTTCATGTC GATGGAAAAC CCAGGTCTAT GGATTCTffiGG GTGCCACAAC TCAGACTTTC 22 2 0 

GGAACAGAGG CATGACCGCC TTACTGAAGG TTTCTAGTTG TGACAAGAAC ACTGGTGATT 22 80 

ATTACGAGGA CAGTTATGAA GATATTTCAG CATACTTGffT GAGTAAAAAC AATGCCATTG 23 40 

AACCAAGAAG CTTCTCCCAG AACCCACCAG TCTTGAAACG CCATCAACGG GAAATAACTC 2400 

GTACTACTCT TCAGTCAGAT CAAGAGGAAA TTGACTATGA TGATACCATA TCAGTTGAAA 24 60 

TGAAGAAGGA AGATTTTGAC ATTTATGATG AGGATGAAAa\ TCAGAGCCCC CGCAGCTTTC 2520 

AAAAGAAAAC ACGACACTAT TTTATTGCTG CAGTGGAGAG \gCTCTGGGAT TATGGGATGA 2 580 

GTAGCTCCCC ACATGTTCTA AGAAACAGGG CTCAGAGTGG CAGTGTCCCT CAGTTCAAGA 2 64 0 

AAGTTGTTTT CCAGGAATTT ACTGATGGCT CCTTTACTCA QCCCTTATAC CGTGGAGAAC 27 00 

TAAATGAACA TTTGGGACTC CTGGGGCCAT ATATAAGAGC AGAAGTTGAA GATAATATCA 2 7 60 

TGGTAACTTT CAGAAATCAG GCCTCTCGTC CCTATTCCTT CTLATTCTAGC CTTATTTCTT 2 82 0 

ATGAGGAAGA TCAGAGGCAA GGAGCAGAAC CTAGAAAAAA CtWgTCAAG CCTAATGAAA 2 88 0 

CCAAAACTTA CTTTTGGAAA GTGCAACATC ATATGGCACC CAGTAAAGAT GAGTTTGACT 2 94 0 



GCAAAGCCTG GGCTTATT-ilC TCTGATGTTG ACCTGGAAAA AGATGTGCAC TCAGGCCTGA 3 000 

TTGGACCCCT TCTGGTCTGff CACACTAACA CACTGAACCC TGCTCATGGG AGACAAGTGA 3 060 

CAGTACAGGA ATTTGCTCTG, TTTTTCACCA TCTTTGATGA GACCAAAAGC TGGTACTTCA 312 0 

CTGAAAATAT GGAAAGAAAc\ TGCAGGGCTC CCTGCAATAT CCAGATGGAA GATCCCACTT 3180 

TTAAAGAGAA TTATCGCTTC \cATGCAATCA ATGGCTACAT AATGGATACA CTACCTGGCT 3240 

TAGTAATGGC TCAGGATCAA icGATTCGAT GGTATCTGCT CAGCATGGGC AGCAATGAAA 33 00 

ACATCCATTC TATTCATTTC AgTGGACATG TGTTCACTGT ACGAAAAAAA GAGGAGTATA 33 60 

AAATGGCACT GTACAATCTC TATCCAGGTG TTTTTGAGAC AGTGGAAATG TTACCATCCA 342 0 

AAGCTGGAAT TTGGCGGGTG GAATGCCTTA TTGGCGAGCA TCTACATGCT GGGATGAGCA 34 8 0 

cactttttct ggtgtacagc aaVtaagtgtc AGACTCCCCT GGGAATGGCT TCTGGACACA 3 540 

TTAGAGATTT TCAGATTACA GCTTCAGGAC AATATGGACA GTGGGCCCCA AAGCTGGCCA 3 600 

GACTTCATTA TTCCGGATCA AtAaATGCCT GGAGCACCAA GGAGCCCTTT TCTTGGATCA 3 660 

AGGTGGATCT GTTGGCACCA ATgWtATTC ACGGCATCAA GACCCAGGGT GCCCGTCAGA 372 0 

AGTTCTCCAG CCTCTACATC TCT^ACTTTA TCATCATGTA TAGTCTTGAT GGGAAGAAGT 37 80 

ggcagactta tcgaggaaat tccA:tggaa CCTTAATGGT CTTCTTTGGC AATGTGGATT 3 840 

CATCTGGGAT AAAACACAAT ATTTTTAACC CTCCAATTAT TGCTCGATAC ATCCGTTTGC 3 900 

ACCCAACTCA TTATAGCATT CGCAGCACTC TTCGCATGGA GTTGATGGGC TGTGATTTAA 3 960 

ATAGTTGCAG CATGCCATTG GGAATCGAGA GTAAAGCAAT ATCAGATGCA CAGATTACTG 4 02 0 

CTTCATCCTA CTTTACCAAT ATGTTTGCCA CCTGGTCTCC TTCAAAAGCT CGACTTCACC 4080 

TCCAAGGGAG GAGTAATGCC TGGAGM:CTC AGGTGAATAA TCCAAAAGAG TGGCTGCAAG 4140 

TGGACTTCCA GAAGACAATG AAAGTCACAG GAGTAACTAC TCAGGGAGTA AAATCTCTGC 42 0 0 

TTACCAGCAT GTATGTGAAG GAGTTCCfTCA TCTCCAGCAG TCAAGATGGC CATCAGTGGA 42 60 

CTCTCTTTTT TCAGAATGGC AAAGTAAAGG TTTTTCAGGG AAATCAAGAC TCCTTCACAC 43 2 0 

CTGTGGTGAA CTCTCTAGAC CCACCGT'mC TGACTCGCTA CCTTCGAATT CACCCCCAGA 43 RO 

GTTGGGTGCA CCAGATTGCC CTGAGGATGG AGGTTCTGGG CTGCGAGGCA CAGGACCTCT 444 0 

ACTGAGGGTG GCCACTGCAG CACCTGCCAC TGCCGTCACC TCTCCCTCCT CAGCTCCAGG 45 00 

GCAGTGTCCC TCCCTGGCTT GCCTTCTACC TTTGTGCTAA ATCCTAGCAG ACACTGCCTT 45 60 

GAAGCCTCCT GAATTAACTA TCATCAGTCa TGCATTTCTT TGGTGGGGGG CCAGGAGGGT 462 0 

GCATCCAATT TAACTTAACT CTTACCTATt\ TTCTGCAGCT GCTCCCAGAT TACTCCTTCC 46 80 

TTCCAATATA ACTAGGCAAA AAGAAGTGAG GAGAAACCTG CATGAAAGCA TTCTTCCCTG 47 40 

AAAAGTTAGG CCTCTCAGAG TCACCACTTC ffTCTGTTGTA GAAAAACTAT GTGATGAAAC 4800 

TTTGAAAAAG ATATTTATGA TGTTGCGGCC GC 4 83 2 

(2) INFORMATION FOR SEQ ID NO: 47: \ 
(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 14 5 7 amino aci\ds 

(B) TYPE: amino acid \ 
{C) STRANDEDNESS: single \ 
(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE: protein \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Gin lie Glu Leu Ser Thr Cys Ptte Phe Leu Cys Leu Leu Arg Phe 



Cys Phe Ser Ala Thr Arg Arg Tyr Tyr\ Leu Gly Ala Val Glu Leu Ser 
20 25 \ 30 

Trp Asp Tyr Met Gin Ser Asp Leu Gly CBlu Leu Pro Val Asp Ala Arg 
35 40 \ 45 

Phe Pro Pro Arg Val Pro Lys Ser Phe pio Phe Asn Thr Ser Val Val 
50 55 \ 60 

Tyr Lys Lys Thr Leu Phe Val Glu Phe ThA Asp His Leu Phe Asn lie 
65 70 \lS 80 

\ 

V 



Ala Lys Pro Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin 

85 90 95 

Ala Glu Val Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser 
100 105 110 

His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 135 140 

Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu 
145 150 155 160 

Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 
165 170 175 

Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie 
180 185 190 

Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 205 

Gin Thr Leu His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly 
210 215 220 

Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 230 235 240 

Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 
245 250 255 

Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val 
260 265 270 

Tyr Trp His Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie 
275 280 285 

Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 
290 295 300 

Leu Glu lie Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met 
305 310 315 320 

Asp Leu Gly Gin Phe Leu Leu Phe Cys His lie Ser Ser His Gin His 
325 330 335 

Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
340 345 350 

Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 



Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 

370 375 380 



Pro Ser Phe 
385 

Trp Val His 



Leu Val Leu 



Asn Gly Pro 
435 

Ala Tyr Thr 
450 

Ser Gly lie 
465 




Arg Ser Val Ala 



Gin Arg I^e 



Asp Glu Th:^ 



Ala Glu 



Asp Arg 



Gly Arg 
440 



Glu Glu 
410 

Ser Tyr 
425 



Lys Lys His Pro Lys Thr 
395 400 

Asp Trp Asp Tyr Ala Pro 
415 

Lys Ser Gin Tyr Leu Asn 
430 



Lys Tyr Lys Lys Val Arg Phe Met 
445 



Phe Lys 
455 



Leu 



Gly Pro 
470 



Thr Arg 
Leu Leu Tyr Gly 



Glu Ala lie Gin His Glu 
460 

Glu Val Gly Asp Thr Leu 
475 480 



Leu lie lie Phe 



Lys Asn din Ala Ser Arg Pro Tyr Asn lie Tyr Pro 
485 \ 490 495 



His Gly lie Thr Asp Val Arb Pro Leu Tyr 
500 \ 505 



Ser Arg Arg Leu Pro Lys 
510 



Gly Val Lys 
515 



His Leu Lys Asp\Phe 
\52 0 



Pro lie Leu Pro Gly Glu lie Phe 
525 



Lys Tyr Lys 
530 



Trp Thr Val Thr Mai 
535 



Glu Asp Gly Pro Thr Lys Ser Asp 
540 



Pro Arg Cys Leu Thr Arg Tyr T^^ Ser Ser Phe Val Asn Met Glu Arg 
545 550 \ 555 560 



Asp Leu Ala 
Ser Val Asp 



lie Leu Phe 
595 



Ser 



Gin 
580 



Gly Leu lie Glyl 
565 

Arg Gly Asn Gin 



Ser Val Phe Asp Glu 
600 



Pro Leu Leu lie Cys Tyr Lys Glu 

570 575 

□.e Met Ser Asp Lys Arg Asn Val 

5^5 590 

As\i Arg Ser Trp Tyr Leu Thr Glu 
605 



Asn lie Gin 
610 

Pro Glu Phe 
625 

Phe Asp Ser 
Tyr lie Leu 



Arg Phe Leu Pro Asn 
615 

Gin Ala Ser Asn lie 
630 



Leu 



Ser 
660 



Gin Leu Ser Val 
645 

lie Gly Ala Gin 



Pro\Ala Gly Val Gin Leu Glu Asp 

620 

Met }\is Ser lie Asn Gly Tyr Val 

635 640 

Cys Ldu His Glu Val Ala Tyr Trp 
650 655 

Thr Asd Phe Leu Ser Val Phe Phe 
665 \ 670 



Ser Gly Tyr 
675 



Thr Phe Lys His Lys 
680 



Met Val \Tyr Glu Asp Thr Leu Thr 
685 



Leu Phe 
690 

Gly Leu 
705 



Pro pke 



Ser Gly Glu Thr 
695 



Trp lie. Leu Gly Cys His 
710 



Val Phe Met Ser Met Glu Asn Pro 
700 

Asn Ser Asp Phe Arg Asn Arg Gly 

715 720 



Met Thr Ala Leu 



Leu Lys Val Ser 
725 



Ser Cys Asp Lys Asn Thr Gly Asp 
730 735 



Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys 
740 ' 745 750 



Asn Asn Ala lie Glu ^ro Arg Ser 
755 760 



Phe Ser Gin Asn Pro Pro Val Leu 
765 



Lys Arg 
770 

Glu Glu 
785 

Asp Phe 
Gin Lys 



His Gin 
lie Asp 
Asp lie 



Lys Thr 
820 



Arg GlVi lie Thr 

\ 775 



Arg Thr Thr Leu Gin Ser Asp Gin 
780 



Tyr Asp -Asp Thr lie Ser Val Glu Met Lys Lys Glu 

795 800 



790 



Tyr Asp Glu Asp 
805 



Glu Asn Gin Ser Pro Arg Ser Phe 
810 815 



Arg His Tyi\ Phe lie Ala Ala Val Glu Arg Leu Trp 
82 5 83 0 



Asp Tyr 



Ser Gly 
850 

Asp Gly 
865 



Gly Met 
835 



Ser Ser Ser Pro 
8Vo 



\ 

Ser Val Pro Gin Phe Ly^ 
855 

Ser Phe Thr Gin Pro Leu 
870 



Leu Gly Leu Leu 



Gly Pro Tyr lie 

885 



His Val Leu Arg Asn Arg Ala Gin 
845 

Lys Val Val Phe Gin Glu Phe Thr 
860 

r Arg Gly Glu Leu Asn Glu His 
875 880 

Ar^ Ala Glu Val Glu Asp Asn lie 
\ 890 895 




Met Val 



Ser Leu 



Lys Asn 
930 



Thr Phe 
900 

He Ser 
915 



Phe Val Lys Pro Asn Glu 
935 




Arg Asn Gin Ala Ser Wg Pro Tyr Ser Phe Tyr Ser 
905 \ 910 



Tyr Glu Glu Asp 
920 



Gin Ai^ Gin Gly Ala Glu Pro Arg 
925 

Thr Lys \Thr Tyr Phe Trp Lys Val 
940 



Gin His His Met Ala Pro Thr Lys 
945 950 



Asp Glu Phe Asp Cys Lys Ala Trp 
955 960 



Ala Tyr 
He Gly 



Phe Ser Asp Val Asp Leu 
965 



Glu Lys AspVVal His Ser Gly Leu 
970 \ 975 



Pro Leu Leu Val Cys His Thr Asn Thr L^u Asn Pro Ala His 

980 985 \ 990 



Gly Arg GlA Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr lie Phe 
995\ 1000 1005 

Asp Glu Thr Ws Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys 
1010 \ 1015 1020 

Arg Ala Pro C^s Asn lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn 
1025 \ 1030 1035 1040 

Tyr Arg Phe His\ Ala He Asn Gly Tyr He Met Asp Thr Leu Pro Gly 
\l045 1050 1055 

Leu Val Met Ala Gin Asp Gin Arg He Arg Trp Tyr Leu Leu Ser Met 
106o\ 1065 1070 

Gly Ser Asn Glu Ain He His Ser He His Phe Ser Gly His Val Phe 
1075 \ 1080 1085 

\ 

Thr Val Arg Lys LyA Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr 
1090 \ 1095 1100 



Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly He 
1105 \llO 1115 1120 

Trp Arg Val Glu Cys L^u He Gly Glu His Leu His Ala Gly Met Ser 
1125 \ 1130 1135 

Thr Leu Phe Leu Val Tyr\ Ser Asn Lys Cys Gin Thr Pro Leu Gly Met 
1140 \ 1145 1150 



Ala Ser Gly H 
1155 



is He Arg \< 



Arg Asp Phe Gin He Thr Ala Ser Gly Gin Tyr 



1160 



1165 



Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser He 
1170 llTiS 1180 

Asn Ala Trp Ser Thr Lys Glu \Pro Phe Ser Trp He Lys Val Asp Leu 
1185 1190 \ 1195 1200 

Leu Ala Pro Met He He His cVy He Lys Thr Gin Gly Ala Arg Gin 
1205 \ 1210 1215 

Lys Phe Ser Ser Leu Tyr He SeA Gin Phe He He Met Tyr Ser Leu 
1220 \ 1225 1230 

Asp Gly Lys Lys Trp Gin Thr Tyr krg Gly Asn Ser Thr Gly Thr Leu 
1235 124o\ 1245 

Met Val Phe Phe Gly Asn Val Asp S^r Ser Gly He Lys His Asn He 
1250 1255 \ 1260 

Phe Asn Pro Pro He He Ala Arg Tyi He Arg Leu His Pro Thr His 
1265 1270 \ 1275 1280 



Tyr Ser He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu 
1285 \290 1295 



Asn Ser Cys \Ser Met Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp 
L300 1305 1310 

Ala Gin lie Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp 
1315 A 1320 1325 

Ser Pro Ser Ly^ Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp 
1330 \ 1335 1340 

Arg Pro Gin Val VAsn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin 
1345 \ 1350 1355 1360 

Lys Thr Met Lys \^1 Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu 
J65 1370 1375 

Leu Thr Ser Met Tyx Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp 
1380 \ 1385 1390 

Gly His Gin Trp Thr\ Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe 
1395 \ 1400 1405 

Gin Gly Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro 
1410 \ 1415 1420 

Pro Leu Leu Thr Arg T^Ar Leu Arg lie His Pro Gin Ser Trp Val His 
1425 14?0 1435 1440 

Gin lie Ala Leu Arg Met\Glu Val Leu Gly Cys Glu Ala Gin Asp Leu 
1445 \ 1450 1455 

Tyr 



[2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino V^ids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : singl\ 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ \ ID NO: 48: 

Arg Gly Met Thr Ala Leu Leu Lyi Val Ser Ser Cys Asp Lys Asn Thr 
1 5 \ 10 15 

Gly Asp Tyr Tyr Glu Asp Ser Tyr blu Asp lie Ser Ala Tyr Leu Leu 
20 IS 30 

Ser Lys Asn Asn Ala lie Glu Pro AVrg Ser Phe Ser Gin Asn Ser Arg 
35 40 \ 45 

His Pro Ser Thr Arg Gin Lys Gin Phfe Asn Ala Thr Pro Pro Thr Pro 
50 55 \ 60 



Pro Thr Pro Pro Val Leu Lys Arg His\Gln Arg Glu He Thr Arg Thr 
65 70 \ 75 80 



Thr Leu qln Ser Asp Gin Glu Glu lie Asp Tyr Asp Asp Thr lie Ser 
85 90 95 

Val Glu MeV Lys 
100 



(2) 



INFORMATION 
(i) 



TOR SEQ ID NO:49 : 



STRi^DEDNESS : single 
TOPOLOGY: linear 



SEQUENC^ CHARACTERISTICS: 

(A) LEwfcTH: 3 00 base pairs 

(B) TYPfe: nucleic acid 
(C) 
(D) 

(ii) MOLECULE "tYPE : DNA (genomic) 
(xi) SEQUENCE AeSCRIPTION: SEQ ID NO: 49 
AGAGGCATGA CCGCCTTMTT GAAGGTTTCT AGTTGTGACA 
GAGGACAGTT ATGAAGAtWt TTCAGCATAC TTGCTGAGTA 
AGAAGCTTCT CCCAGAATTC TAGACACCCT AGCACTAGGC 
CCTCCTACAC CACCAACCCC ACCAGTACTG AAACGCCATC 
ACTCTTCAGT CTGATCAAGA GGAAATTGAC TATGATGATA 



AGAACACTGG 
AAAACAATGC 
AAAAGCAATT 
AACGGGAAAT 
CCATATCAGT 



TGATTATTAC 
CATTGAACCA 
TAATGCCACC 
AACTCGTACT 
TGAAATGAAG 



60 
120 
180 
240 
300 



(2) INFORMATION FOR 3EQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: \27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDl^ESS : single 

(D) TOPOLOGY :\ linear 
(ii) MOLECULE TYPE : \protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



Ser Arg His Pro Ser 
1 5 



rhr Arg Gin Lys Gin Phe Asn Ala Thr Pro Pro 
10 15 



Val Leu Lys Arg His 
20 



Gtn Arg Glu lie Thr Arg 

25 



(2) INFORMATION FOR SEQ ID N(i:51: 
(i) SEQUENCE CHARACTERI3rriCS : 

(A) LENGTH: 81 base\ pairs 

(B) TYPE: nucleic aaid 

(C) STRANDEDNESS : si^^gle 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (g 
(xi) SEQUENCE DESCRIPTION: 

TCGCGACACC CTAGCACTAG GCAAAAGC 
CATCAACGGG AAATAACGCG T 



omic ) 

EQ ID NO: 51; 
TTTAATGCCA CCCCACCAGT CCTGAAACGC 



60 
81 



(2) INFORMATION FOR SEQ ID NO: 52 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH; 27 base paii^s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single\ 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomiJ 
(xi) SEQUENCE DESCRIPTION: SEQ 



[D NO: 52 : 



ACTACTCTTC AATCTGATCA AGAGGAA 



27 



(2) INFORMATIOta FOR SEQ ID NO: 53: 



(i) 



(ii) 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) aTRANDEDNESS : single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA {genomic} 
SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



CGCGCCGCTC GAGTQTACAA TGGCTTTGCC TTTTGCTTTA CTG 



43 



(2) INFORMATION FOR SEQ ID NO: 54: 



(i) 



(ii) 
(xi) 



SEQUENCt: CHARACTERISTICS: 

(A) LEMfGTH: 43 base pairs 

(B) TYHE: nucleic acid 

(C) STiWdEDNESS: single 

(D) TOPOLOGY: linear 
molecule! TYPE: DNA (genomic) 
SEQUENCE \dESCRIPTION: SEQ ID NO: 54: 



GCGCCCATCG ATTTATltCT TCCTCCTTAA CCTTTCTTGC AAG 



43 



(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE QHARACTERISTICS : 

(A) LENGT^: 48 base pairs 

(B) TYPE:\nucleic acid 

(C) STRANffiEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CGCGCCGCTC GAGCATCCcA ATGGCCCTGT CCTTTTCTTT ACTGATGG 



48 



(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 9 base pairs 

(B) TYPE: nutleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY 1 linear 

(ii) MOLECULE TYPE:! DNA (genomic) 
(xi) SEQUENCE DESCRITPTION : SEQ ID NO: 56: 
CCATCGATTC AATCCTTCCT CCTTAATCTT TTTTGCAAG 



(2) INFORMATION FOR SEQ \lD NO: 57: 
(i) SEQUENCE CHARACtTERISTICS : 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucliic acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY: llinear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57; 
CCGGATCCTC TACAATGGCT TTGQCTTTTG CTTTACTG 



39 



(2) INFORMATION FOR SEQ ID\NO:58: 
(i) SEQUENCE CHARACTEltsTICS : 

(A) LENGTH: 46 bake pairs 

(B) TYPE: nucleic lacid 



(C) STRANDfeDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DEicRIPTION; SEQ ID NO: 58: 

CGCGCCGGCG GCCGCTTATT CCTTCCTCCT TAACCTTTCT TGCAAG 



46 



INFORMATION FOR 
(i) SEQUENCE CHA! 
(A) LENGTH; 
(B) 
(C) 
(D) 



:Q id NO: 59 : 

CTERISTICS: 
43 base pairs 
TYPE: niicleic acid 
STRANDEIX^ESS : single 
TOPOLOGyI: linear 



DNA (genomic) 



(ii) MOLECULE TYPE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CCGGATCCCA TCCCAATGGC CCTGTCCTTT TCTTTACTGA TGG 

(j) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 base pairs 

(B) TYPE: nuc 

(c) strandedn: 

(D) TOPOLOGY: 
MOLECULE TYPE: 



43 



Leic acid 
ESS: single 
linear 

DNA (genomic) 



(11] 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 60: 
CGCGCCGGCG GCCGCTCAAT CqTTCCTCCT TAATCTTTTT TGCAAG 



46 



(J) INFORMATION FOR SEQ tD NO: 61: 
!i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90lbase pairs 

(B) TYPE: nucldic acid 

(C) STRANDEDNESB : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DWA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
AGCTTGCTGT TTGTGTGCTG CCTOTGAAGT CCACACTGAA CAAACTTCAG CCTACTCATG 
TCCCTAAAAT GGGCAAACAT TGCaKgCAGC 



60 
90 



(j) INFORMATION FOR SEQ ID \ NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bise pairs 

(B) TYPE: nucleicl acid 

(C) STRANDEDNESS:\ single 

(D) TOPOLOGY: liniar 
(ii) MOLECULE TYPE: DNA W genomic) 
(xi) SEQUENCE DESCRIPTIctj : SEQ ID NO: 62: 

AAACAGCAAA CACACAGCCC TCCCTCpCTG CTGACCTTGG AGCTGGGGCA GAGGTCAGAG 
ACCTCTCTGA 



60 
70 



(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base! pairs 

(B) TYPE: nucleic aAid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear\ 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: ISEQ ID NO: 63: 



AGCTTCAGAG AGGTCTCTGA CCTCTGCCC 



AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 



60 



TGTGTTTGCT GTTTGCTGCT TG 82 

(2) INFORMATOON FOR SEQ ID NO: 64: 
(i) SEQOENCE CHARACTERISTICS: 
(A); LENGTH: 78 base pairs 
{B)\ TYPE: nucleic acid 

(C) \STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLEC^JLE TYPE; DNA (genomic) 

(xi) SEQUEt^CE DESCRIPTION: SEQ ID NO: 64: 
CAATGTTTGC CCAI^TTTAGG GACATGAGTA GGCTGAAGTT TGTTCAGTGT GGACTTCAGA 60 
GGCAGCACAC AAAC^GCA 7 8 

(2) INFORMATION \fOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYRE: nucleic acid 

(C) STr\^DEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE WpE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AATTCGCTGT TTGTGTGC\TG CCTCTGAAGT CCACACTGAA CAAACTTCAG CCTACTCATG 60 
TCCCTAAAAT GGGCAAACAT TGCAAGCAGC 9 0 

\ 

(2) INFORMATION FOR 'iSEQ ID NO:66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHi 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEpNESS : single 

(D) TOPOLOGt: linear 

(ii) MOLECULE TYPE\: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
AAACAGCAAA CACACAGCCC TfCCCTGCCTG CTGACCTTGG AGCTGGGGCA GAGGTCAGAG 60 
ACCTCTCTGG 7 0 

(2) INFORMATION FOR SEq\iD NO: 67: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 ' base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES'g : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DIfA (genomic) 

(xi) SEQUENCE DESCRI PINION: SEQ ID NO: 67: 
AATTCCAGAG AGGTCTCTGA CCTC^GCCCC AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 60 
TGTGTTTGCT GTTTGCTGCT TG ' 82 



(2) INFORMATION FOR SEQ ID 1^0:68: 
(i) SEQUENCE CHARACTERHSTICS : 

(A) LENGTH: 78 basW pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 68: 
CAATGTTTGC CCATTTTAGG GACATGACjTA GGCTGAAGTT TGTTCAGTGT GGACTTCAGA 6 0 

GGCAGCACAC AAACAGCG V 7 8 



(2) INFORMATION FOR SEQ ID NO: 69: 
( i ) SEQUENCE \cHARACTERISTICS : 

(A) LENQTH: 33 base pairs 

(B) TYPEt nucleic acid 

(C) STRAffiDEDNESS : single 

(D) TOPODjOGY: linear 

(ii) MOLECULE T\YPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CGCGCCGCCC GGGGTAGArC TTGCTACCAG TGG 



33 



(2) INFORMATION FOR 1 SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 40 base pairs 

(B) TYPE: hucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYRE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GCGCCCGCGG CCGCCACTGT \cCCAGGTCAG TGGTGGTGCC 



40 



(2) INFORMATION FOR SED ID NO: 71: 



(i) 



(ii) 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucILeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: \linear 
MOLECULE TYPE: DNA (genomic) 
SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 



46 



(2) INFORMATION FOR SEQ it) NO: 72: 
(i) SEQUENCE CHARACT^IRI STICS : 

(A) LENGTH: 43 Ifcase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGY: liAear 
(ii) MOLECULE TYPE: DNi^ (genomic) 
(xi) SEQUENCE DESCRI PTlt)N : SEQ ID NO: 72: 

GCGCCCATCG ATTCATTCCT TACTT^TTAA ACTTTCTTGC AAG 



43 



(2) INFORMATION FOR SEQ ID NO: 73: 



(i) 



111] 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 basi pairs 

(B) TYPE: nucleic abid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lineal 
MOLECULE TYPE: DNA (denomic) 
SEQUENCE DESCRIPTION:! SEQ ID NO: 73: 



CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 



46 



(2) INFORMATION FOR SEQ ID NO : 1 4 : 
(i) SEQUENCE CHARACTERISTKTS : 

(A) LENGTH: 43 base p^irs 

(B) TYPE: nucleic acic 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA {genomic} 

(xi) SEQUENCE bESCRIPTION: SEQ ID NO: 74: 

GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 43 

\ 

\ 

(2) INFORMATION F0R\ SEQ ID NO: 75: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTpi: 6 amino acids 

(B) TYPE: Hmino acid 

(C) STRANDfiDNESS : single 

(D) TOPOLodY; linear 
(ii) MOLECULE TYPp : peptide 

(xi) SEQUENCE DEStRIPTION: SEQ ID NO: 75: 
Asn Ser Arg His bro Ser 

1 i 



(2) INFORMATION FOR S^Q ID NO: 76: 
(i) SEQUENCE CHAI^CTERISTICS : 

(A) LENGTH: tL4 amino acids 

(B) TYPE: amino acid 

(C) STRANDED^ESS : single 

(D) TOPOLOGY:: linear 
(ii) MOLECULE TYPE:^ protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gin Asn Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thr 
1 5 \ 10 

I 

(2) INFORMATION FOR SEQ tD NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 b&se pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTI.ON : SEQ ID NO:77: 
AATTCGCGAC ACCCTAGC \ 18 

(2) INFORMATION FOR SEQ ID 1^0: 78: 
(i) SEQUENCE CHARACTERl'STICS : 

(A) LENGTH: 42 bas^ pairs 

(B) TYPE: nucleic dcid 

(C) STRANDEDNESS : s'4.ngle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION:, SEQ ID NO: 78: 
CAAAACCCAC CAGTCTTGAA ACGCCATCAA CGGGAAATAA CG 42 



(2) INFORMATION FOR SEQ ID NO: 79: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base phirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
GCGCTGTGGG ATCGGTTTTG GGTGGTCAGA AC 3 2 



(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGtTH; 24 base pairs 

(B) TYPEt nucleic acid 

(C) STRAioEDNESS : single 

(D) TOPOliOGY: linear 

(ii) MOLECULE tWpE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

tttgcggtag TTGCCCTTTA TTGC 24 

\ 

(2) INFORMATION FOR IsEQ ID NO: 81: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHt 6 amino acids 

(B) TYPE: Amino acid 

(C) STRANDHDNESS : single 

(D) TOPOLOOY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



Arg Thr Leu Gin aer Asp 

1 5^, 




\ 

(2) INFORMATION FOR SEQ, ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1^ base pairs 

(B) TYPE: nucl^^eic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: liWA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

CGTACTCTTC AGTCT 15 

\ 

INFORMATION FOR SEQ lEi'. NO:83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bdse pairs 

(B) TYPE: nucleic, acid 

(C) STRANDEDNESS : .single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GCATGAGAAG TCAGACTAG \ 19 



